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Description 

Methods For Producing Soluble, Biologically-Active Disulfide Bond- - 
Containing Eukaryotic Proteins In Bacterial Cells 

1. Background Of The Invention 

The present application is a continuing application J^fced on U. S. Provisional 
Patent Application Serial Number 60/014,950, filed April 5, 1996, the entire content of 
which is specifically incorporated herein by reference. The United States government 
has certain rights in the present invention pursuant to Grant 1R01-GM47520-01 Al from 
the National Institutes of Health. 

1.1 Field of the Invention 

The present invention relates generally to the field of molecular biology. More 
particularly, certain embodiments concern methods and compositions related to improved 
methods of producing biologically-active, soluble eukaryotic disulfide bond-containing 
eukaryotic polypeptides in bacterial cells. In preferred embodiments eukaryotic proteins 
such as tissue plasminogen activator (tPA) and pancreatic trypsin inhibitor (PTI) are 
produced in Escherichia coli cells using recombinant vectors which direct the cocxpression 
of these proteins with the eukaryotic enzyme, protein disulfide isomerase (PDI). 

1.2 Description Of The Related Art 

1.2.1 Protein Expression in Bacterial Hosts 

A significant achievement in molecular biology has been the use of recombinant 
bacterial cells to produce eukaryotic proteins. This method has been particularly useful for 
production of medically important polypeptides that are obtained in low yield from natural 
sources. Often otherwise difficult to obtain in quantity, such proteins are "overexpressed" 
in the host cell and subsequently isolated and purified. Preinsulin for example may be 
produced in a recombinant prokaryotic microorganism carrying DNA encoding rat 
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preinsulin (U. S. Patents 4,431,740 and 4.652,525, specifically incorporated herein by 
reference). 

Expression of multiple disulfide bond-containing eukaryotic polypeptides, and 
particularly mammalian proteins, in bacterial cells has frequently produced disappointing 
and unsatisfactory results because conditions and environment in the host cells were not 
conducive to correct folding. Disulfide bond formation is a pa^cess mainly restricted to 
proteins outside the cytoplasmic compartment such as those secreted into the lumen of the 
endoplasmic reticulum (ER) or the periplasm of gram negative bacteria. Correct folding 
may depend on the formation of cysteine-cysteine linkages and subsequent stabilization of 
the protein into an enzymatically active structure. However, the cytoplasm is in fact a 
reducing environment due to the presence of thioredoxin reductase or reduced glutathione, 
thus blocking oxidation so that disulfide bonds do not form. The endoplasmic reticulum 
(ER) apparently is more conducive to oxidation due to the presence of oxygen or oxidized 
glutathione. 

Recent studies indicate that disulfide bond formation in vivo is a catalyzed process, 
whether in the ER or periplasm. In E. coh\ a pathway for the formation of disulfide bonds 
in secreted proteins has been described, involving two proteins, DsbA and DsbB (Bardwell 
et al„ 1 993; Missiakas et al % 1 993). 

A role for these Dsb proteins is supported by the observation that mutants of E. 
coli that lack DsbA or DsbB are defective with Fespect to disulfide bond formation 
(Dailey and Berg, 1993). In the yeast Saccharomyces cerevisiae y a similar defect is 
found in certain mutants defective in protein disulfide isemerase (PDI) gene. Disulfide 
bond formation in carboxypeptidase Y in these mutants is impaired. 

1.2.2 Expression of Eukaryotic Proteins in Bacterial Hosts 

It is known that disulfide bonds are critical in some proteins in order for proper 
folding and even in transport and secretion. Yet many proteins cannot be efficiently 
expressed in bacterial hosts due to failure of disulfide bond formation. Cytoplasmic 
expression systems in bacteria are not conducive to disulfide bond formation because of 
a reducing environment. The presence of proteases in the cytoplasm may cause rapid 
degradation of the protein, resulting in low yields. 
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Most exported proteins contain disulfide bonds which confer increased 
thermodynamic stability to the folded polypeptide chain. The sequence of events 
involved in cysteine oxidation and correct pairing to form native disulfide bonds is a 
critical step in protein folding. Due to constraints related to the reactivity or structural 
accessibility of cysteine thiols in proteins, disulfide bonds often form very slowly. A 
complex cellular machinery, whose components and mode of action are only now 
beginning to be understood, has evolved to catalyze thesC processes in vivo. In 
Gram-negative bacteria such as E. coli, the cytoplasm is highly reducing and therefore 
disulfide formation normally occurs after a polypeptide chain has been translocated 
across the inner membrane (Wiilfing and Pluckthun, 1994; Bardwell, 1994). Genetic 
analysis has identified at least six genes coding for cell envelope proteins that play a role 
in disulfide bond formation. Four of these proteins have been characterized in some 
detail (Bardwell, 1994; Missiakas et al, 1995). DsbA is a 21.5 kDa enzyme having a 
thioredoxin-like subdomain with an extremely reactive and highly oxidizing disulfide 
15 bond but poor disulfide isomerization activity (Bardwell et al, 1991; Kamitani et al, 

1992; Zapun el al., 1993; Wunderlich and Glockshuber, 1993a; 1993b; Joly and Swartz, 
1994). DsbB is a cytoplasmic membrane protein which is required for the reoxidation of 
DsbA (Guilhot et al., 1995; Bardwell et al, 1993; Missiakas et al, 1993; Dailey and 
Berg, 1993). DsbC is another soluble cysteine oxidoreductase and has much higher 
disulfide isomerase activity than DsbA (Bardwell, 1 994; Missiakas et al, 1 994). Finally, 
the recently discovered DsbD is an inner membrane protein which has been proposed to 
function as a reducing source in the periplasm and to be required for maintaining proper 
redox conditions (Missiakas et al, 1995). 

Bacterial proteins become oxidized and fold rapidly soon after export from the 
25 cytoplasm. However, the formation of native disulfide bonds in heterologous proteins 

with multiple cysteines is often very inefficient (Wunderlich and Glockshuber, 1993a; 
1993b; De Sutter et al, 1992). Partially folded" molecules are highly susceptible to 
degradation, thus resulting in very low yields (Wiilfing and Pluckthun, 1994). The 
shortcomings of the disulfide bond formation machinery of E. coli with respect to 
eukaryotic proteins have been illuminated by analyzing the folding pathway of the 
Bovine PTI (BPTI) expressed in the periplasmic space (Ostermeier and Georgiou, 1994). 
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BPTI is a 6.5-kDa protease inhibitor with three disulfide bonds. In E. coli, just as in 
vitro, the rate limiting step in folding is the isomerization of two disulfide intermediates. 
The bacterial periplasmic space is thought to be strongly oxidizing (Wunderlich and 
Glockshuber, 1993a; 1993b; Walker and Gilbert, 1994; Kishigami et at., 1995) and 
appears to lack sufficient disulfide isomerase activity required for the folding of 
heterologous multi-disulfide proteins. In sharp contrast to^he bacterial periplasm, 
disulfide bond formation in eukaryotes occurs in the endoplasmic reticulum, a 
compartment which is maintained at relatively reducing conditions (Hwang et al* 1992). 
Disulfide bond formation and isomerization in the ER is catalyzed by PDI, an abundant 
55-kDa enzyme which apart from its thioredoxin-like active site shares little homology 
with prokaryotic proteins. PDI contains two active sites that are not functionally 
equivalent, has been shown to both promote and inhibit protein aggregation, and can 
exist in different oligomerization states (Freedman et al^ 1994; Lyles and Gilbert, 1994; 
Puig and Gilbert, 1994; Puig et aU 1 994). 

1 .2.3 Current Methods of Producing Eukaryotic Proteins are Inefficient 

Expression enhancers for increasing yield of eukaryotic proteins expressed in E. 
coli cells have been reported (U. S. Patent 5,336,602). The expression enhancer is 
simultaneously expressed with a protein of interest where the rate of,expression is shown 
to increase by comparison with expression of the protein of interest in the absence of an 
enhancer. However, while yield is increased over expression when enhancer is not 
present, there are no indications that either correct foldingls achieved or that full activity 
is obtained. 

Eur. Pat. Appl. No. EP 510,658 describes an improvement of the yield of secreted 
disulfide-bonded proteins in bacterial cell by providing a simultaneous expression of a 
recombinant vector encoding the prokaryotic protein disulfide isomerase of E. coli and 
the addition of thiol reagents to the culture medium to promote correct folding of the 
secreted polypeptide of interest. Unfortunately, the method produced negligible secreted 
protein unless sufficient thiol reagent was added to the culture medium, and if too much 
thiol reagent was present, cells were killed and the total protein isolated declined 
dramatically. 
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tPA is one example of a pharmaceutically-important drug produced by 
recombinant methods. Unfortunately the current methods for producing tPA from 
bacterial cell culture are both costly and laborious. One such method for the production 
of tPA in heterologous host organisms relies on the production of inactive tPA 
intracellularly in inclusion bodies, and the subsequent isolation and purification of such 
inclusion bodies, followed by activation of the tPA once freed from the inclusion bodies. 
U. S. Patent 5,077,392 discloses a renaturation method for refolding denatured proteins 
obtained after expression in inclusion bodies. tPA was isolated as a denatured reduced 
protein and on subsequent oxidation refolded under oxidizing conditions to obtain what 
was reported as up to a 26% yield of "reactivated" protein. While the method appeared to 
improve polypeptide yield, the process involves multiple, time-consuming steps, due to 
the initial recovery of the insoluble, inactive protein. 

Other methods of producing tPA have employed eukaryotic cell culture methods, 
which are also expensive and time-consuming. Mammalian cells have been used in 
attempts to improve production of highly active polypeptides such as tPA. U. S. Patent 
4,661,453 discloses production of tPA in substantial quantities in rat prostate 
adenocarcinoma cells. The tPA isolated from the cell culture medium shows tPA 
activity, however the method is quite expensive since the mammalian cells have an 
origin in spontaneous adenocarcinoma cancer cells, and must be selected for the ability to 
produce tPA. The method has not been shown to be feasible for commercial production 
of proteins such as tPA on an economic scale. Even methods involving the production of 
tPA in recombinant Chinese hamster ovary (CHO) cells, result in a cost-per-unit-dose of 
approximately $1200 in the current pharmaceutical market. 

1.2.4 Deficiencies in the Prior Art 

Currently there is a lack of efficient methods of producing complex eukaryotic 
proteins with multiple disulfide bonds on an economic scale. Likewise, there is a need to 
develop methods which produce proteins that are correctly folded and active without the 
need for reactivation or subsequent processing once isolated from a host cell. 

Therefore, what is lacking in the prior art are methods, recombinant vectors, host 
cells, and compositions comprising high-level expression of eukaryotic disulfide bond- 
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containing polypeptides (such as tPA and BPTI) which are soluble, correctly-folded, 
active, and readily isolatable from cell extracts of prokaryotic hosts. 

2. Summary of the Invention 

5 The present invention overcomes one or more of these and other drawbacks 

inherent in the prior art by providing novel methods, recombinant host cells, vectors and 
compositions resulting therefrom for efficiently producing eukaryotic polypeptides 
containing disulfide bonds in bacterial host cells which are active, correctly folded, and 
secreted from the bacterial cell to provide economic and convenient means for the 

10 recovery, isolation, and purification of the recombinant polypeptide of interest. The 

present invention represents a significant breakthrough in the fields of molecular biology, 
protein chemistry, and pharmaceutics, in producing eukaryotic recombinant polypeptides 
in prokaryotic hosts through novel methods and recombinant vectors which direct the 
coexpression of eukaryotic polypeptides of interest with a eukaryotic foldase such as 

15 protein disulfide isornerase. Recovery of correctly folded, active, soluble recombinant 

polypeptides in significant quantity is now possible by employing the disclosed methods 
and compositions. 

In one embodiment, the present invention provides a process for producing in a 
bacterial cell, a biologically -active, soluble eukaryotic polypeptide having at least about 

20 three disulfide bonds. The process generally involves expressing in the cell a first DNA 

segment encoding a disulfide isornerase operably linked to a signal sequence and a 
second DNA segment encoding a eukaryotic polypeptide operably linked to a signal 
sequence under conditions effective to produce the eukaryotic polypeptide. Preferably, 
the polypeptide is a mammalian polypeptide, with human and bovine polypeptides being 

25 particularly preferred. In important embodiments, the eukaryotic polypeptide is a tissue 

plasminogen activator or pancreatic trypsin inhibitor, and the disulfide isornerase is 
protein disulfide isornerase isolated from rat, yeast, or human origin. 

The eukaryotic polypeptide to be produced in the bacterial host will preferably 
comprise at least about three, four or five disulfide bonds. Or more preferably, about six, 

30 seven, eight, or nine disulfide bonds. Or still more preferably, at least about ten, eleven, 

twelve, thirteen, fourteen, fifteen, sixteen, or seventeen disulfide bonds. When the 

-6- 
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eukaryotic polypeptide is tPA, the peptide preferably comprises at least about seventeen 
disulfide bonds. 

The signal sequence is preferably selected from the group consisting of OmpA, 
LamB, StII, MalE, Lpp, and PelB, with OmpA and StII sequences be particularly 
5 preferred. Preferred promoters for the expression of the two DNA segments are selected 

from the group consisting of lac-lpp, lpp trc. ara, lac. tac, Tl P BAD , phoA and X P1 . with 
lac-lpp promoters being particular useful in the practice of the invention. Plasmids such 
as pLPPsOmpArPDI are preferred for the expression of the disulfide isomerasc, and 
plasmids such as pTPA177 or pACYCBPTI are particularly desirable for expression of 
10 the eukaryotic polypeptides of interest. 

The bacterial cell may be cultured in a medium comprising one or more reducing 
agents selected from the group consisting of glutathione, cysteine, cystamine, 
thioglycollate, dithiothreitol and dithioerythritol, and preferably, the bacterial cell is an 
Enterobacteriaceae cell such as Escherichia or Salmonella spp. cells. Highly preferred 
15 cells are E. coli ATCC XXXXX, SF103, SF110, UT5600 and RB7911 cells. ATCC 

XXXXX was deposited on , 1997 with the American Type Culture 

Collection, 12301 Parklawn Drive, Rockville, MD 20852 in accordance with U. S. Patent 
and Trademark Office requirements for microorganism deposits. The deposit has been 
made in accordance with the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for the purposes of patent procedure. In 
accordance with the terms of the Budapest Treaty: (a) the deposit will be made accessible 
to the Commissioner upon request during the pendency of this application; (b) all 
restrictions to access by the public of this deposit will be irrevocably removed upon 
granting of the patent; (c) this deposit will be maintained in the public depository for a 
25 period of thirty years or five years after the last request, or for the effective life of the 

patent, whichever is longer; and, (d) this deposit will be replaced if it should ever become 
non-viable. 

An important aspect of the invention is the production in bacterial cells of 
soluble, biologically-active eukaryotic polypeptides. In important embodiments, the 
30 soluble protein is secreted to the periplasm or to the outer membrane of the bacterial cell, 

and the polypeptide is isolatable from a culture supernatant or a soluble fraction of the 
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bacterial cell. Preferably, the eukaryotic polypeptide produced in the bacterial host 
assumes a conformation substantially identical to the conformation assumed by the 
polypeptide when produced in a eukaryotic host cell. Preferably, the eukaryotic 
polypeptide produced in the bacterial cell has a specific activity equal to or greater than 
the specific activity of the polypeptide when produced in a eukaryotic host cell. When 
the eukaryotic polypeptide is a tissue plasminogen activator prc^pein, a specific activity of 
at least about 5 to about 12 jj.g/l/OD 600nm of culture is obtained. This corresponds to a 
specific activity of approximately 2000 to 48000 IU/l/OD 600nm When the eukaryotic 
polypeptide is a pancreatic trypsininhibitor protein, a specific activity of about 10 jxg/mg 
of total cell protein is obtained. 

A further aspect of the invention is an expression system for producing in a 
bacterial cell, a biologically-active, soluble eukaryotic polypeptide. The expression 
system generally comprises a first DNA segment and a second DNA segment, with the 
first segment encoding a disulfide isomerase and the second segment encoding a 
eukaryotic polypeptide having at least about three disulfide bonds. As stated above, the 
protein produced is preferably a disulfide-bond containing polypeptide such as tissue 
plasminogen activator or pancreatic trypsin inhibitor, and the disulfide isomerase is 
preferably a rat, yeast, or human PDI. The DNA segments preferably further comprise a 
signal sequence such as OmpA, LamB, StII, MalE, Lpp, or PelB, and are expressed by a 
promoter such as lac-lpp, ara, lac y Ipp, trc, tac, T7, P B ad> phoA or A. PL . Preferred 
examples of the expression system include pLPPsOmpArPDI co-expressed in a bacterial 
cell with either pTPA177 or pACYCBPTI. The bacte»al cell may be cultured in a 
medium comprising one or more reducing agents selected from the group consisting of 
glutathione, cysteine, cystamine, thioglycollate, dithiothreitol and dithioerythritol, and 
preferably, the bacterial cell is an Enterobacteriaceae cell such as Escherichia or 
Salmonella spp. cells. Highly preferred cells are E. coli ATCC XXXXX, SF103, SF1 10, 
UT5600 and RB79 1 1 cells. 

Preferably the eukaryotic polypeptide expressed by this system has a specific 
activity of at least about 1 to about 1000 |xg/l/OD 60 o nm of culture, or more preferably, 
about 5 to about 500 f^g/l/OD^^ of culture, or more preferably, about 10 to about 100 
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ug/I/OD 600nni of" culture, with specific activities in the range of at least about 5 to about 

1 2 Hg/I/OD 600nm units of culture being highly preferred. 

The expression system may comprise a single recombinant vector which 
expresses both the disulfide isomerase-encoding DNA segment and the polypeptide of 
interest-encoding second DNA segment. Alternatively, the expression system may 
comprise two or more distinct plasmids one of which expresses the first DNA segment 
and a second of which expresses the second DNA segment In the case of the latter 
arrangement, it is preferably that both vectors are capable of replication and expression in 
a single bacterial cell. An example of such as system is a bacterial cell comprising the 
vector pLPPsOmpArPDI in combination with a second vector such as pACYCBPTl or 
pTPA177. 

A further aspect of the invention is a recombinant vector comprising a first 
transcriptional unit encoding a mammalian protein disulfide isomerase operably linked to 
a first signal sequence and a second transcriptional unit comprising a DNA segment 
encoding a mammalian polypeptide having at least about three disulfide bonds operably 
linked to a second signal sequence. 

As stated above, the protein produced is preferably a disul fide-bond containing 
polypeptide such as tissue plasminogen activator or pancreatic trypsin inhibitor, and the 
disulfide isomerase is preferably a rat, yeast, or human PDI. The DNA segments 
preferably further comprise a signal sequence such as OmpA, LamB, StII, MalE, Lpp, or 
PelB, and are expressed by a promoter such as lac-lpp, ara. lac, lpp, trc, tac, T7, P BAD) 
phoA or A. PL . Preferred examples of the recombinant vectors include pLPPsOmpArPDI, 
pTPA 1 77 and pACYCBPTl. The vectors are preferably introduced and maintained in a 
bacterial cell which may be cultured in a medium comprising one or more reducing 
agents selected from the group consisting of glutathione, cysteine, cystamine, 
thioglycollate, dithiothreitol and dithioerythritol, and preferably, the bacterial cell is an 
Enterobacteriaceae cell such as Escherichia or Salmonella spp. cells. Highly preferred 
cells are E coli ATCC XXXXX, SF103, SF110, UT5600 and RB791 1 cells. 

As such, a recombinant host cell transformed with an expression system or vector 
described above is a further embodiment of the invention. Preferred examples of the 
recombinant host cell include ATCC XXXXX, and SF103, SF110, UT5600 or RB791I 

-9- 
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cells transformed with the recombinant vector pLPPsOmpArPDl as well as either of 
plasmids pTPA177 or pACYCBPTL 

A further aspect of the invention is a composition comprising a biologically- 
active, soluble, recombinant tissue plasminogen activator protein or peptide operably 
linked to a bacterial export signal peptide. Preferably the tissue plasminogen activator is 
a mammalian tissue plasminogen activator such as human tPA^in particular aspects, the 
composition comprises a bacterial export signal peptide selected from the group 
consisting of OmpA, LamB, Stll, MalE, Lpp, and PelB. In preferred embodiments, the 
tPA is encoded by a DNA segment positioned under the control of a promoter selected 
from the group consisting of lac-lpp y lpp f trc, tac, T7, P^ad* P^°A and X PL . 

Preferably the tPA composition has a specific activity of at least about 1 to about 
1 000 fj.g/l/OD 600nm of culture, or more preferably, about 5 to about 500 Mg/l/OD 600nm of 
culture, or more preferably, about 10 to about 100 |^g/l/OD 600nni of culture, with specific 
activities in the range of at least about 5 to about 12 jag/l/OD^^ units of culture being 
highly preferred. 

A further aspect of the invention is a composition comprising a biologically- 
active, soluble, recombinant pancreatic trypsin inhibitor protein or peptide operably 
linked to a bacterial export signal peptide. Preferably the pancreatic trypsin inhibitor 
protein is a mammalian pancreatic trypsin inhibitor protein such as human or bovine PTI. 
In particular aspects, the composition comprises a bacterial export signal peptide selected 
from the group consisting of OmpA, LamB, Stll, MalE, Lpp, and PelB. In preferred 
embodiments, the PTI is encoded by a DNA segment positioned under the control of a 
promoter selected from the group consisting of lac-lpp y lpp, trc. iac y T7, P B ai>* phoA and 

Preferably the PTI composition has a specific activity of at least about 1 to about 
1000 jag/l/OD^^ of culture, or more preferably, about 5 to about 500 |ig/l/OD 600nm of 
culture, or more preferably, about 10 to about 100 fig/l/OD 600nm of culture, with specific 
activities in the range of at least about 5 to about 12 ng/l/OD 600nm units of culture being 
highly preferred. 

These and other embodiments of the invention are further understood in light of 
the teaching herein. 
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2.1 Methods for Producing Eukaryotic Polypeptides in Bacterial Cells 
The present invention discloses methods for producing recombinant multi- 
disulfide polypeptides such as PTI, tPA, antibody fragments, protease inhibitors, 
therapeutic enzymes, lymphokines, neurotrophic factors, and related polypeptides and 
derivatives, mutants, and fusion proteins derived therefrom. One of the problems with 
tPA isolated from natural sources is low yields and extensive]uirification processes. The 
present invention in an important embodiment illustrates a strategy for overproducing 
tPA from a bacterial host, employing DNA constructs encoding human tPA and rat PD1 
to transform Gram-negative bacterial cells and coexpress the two proteins to produce 
active, soluble, secreted recombinant tPA polypeptides in vivo. In addition to the 
production of the complete tPA molecule, derivatives of tPA lacking the finger-like 
region, one or both of the kringle sub-domains or the epidermal growth factor subdomain 
may also be expressed in functional form. Such mutants as well as mutated tPA 
molecules with amino acid substitutions that affect the proteolytic activity exhibit useful 
pharmacological and/or pharmacokinetic properties, and are all contemplated to fall 
within the scope of the present invention. 

An important aspect of the invention concerns methods for producing a 
biologically-active, recombinant eukaryotic polypeptide that contains multiple disulfide 
bonds. The method involves co-expressing in a suitable prokaryotic cell, such as a 
bacterial cell, a DNA segment encoding a prokaryotic signal sequence-eukaryotic 
disulfide isomerase fusion protein and a DNA segment_encoding a prokaryotic signal 
sequence-eukaryotic recombinant polypeptide fusion protein under suitable physiological 
conditions to produce the recombinant eukaryotic fusion protein of interest. It is 
contemplated that the fusion protein of interest may be any eukaryotic protein for which 
expression in a prokaryotic host is desirable, but in particular, eukaryotic proteins which 
contain two or more disulfide bonds, and preferably those which contain at least three or 
four disulfide bonds or more. 

Such preferred recombinant polypeptides include mammalian tissue plasminogen 
activator, mammalian pancreatic trypsin inhibitor, antibody fragments, insulin, protease 
inhibitors, therapeutic enzymes, lymphokines, cytokines, growth factors, neurotrophic 



WO 97/38123 



PCT/US97/05636 



factors and the like. The polypeptides may be native or mutated polypeptides, and 
preferred sources for such mammalian polypeptides include human, bovine, equine," 
porcine, lupine, and rodent sources, with human proteins being particularly preferred. 
The disulfide isomerase may be any such eukaryotic foldase which is capable of 
5 isomerizing disulfide bonds in a prokaryotic host. One such isomerase which is 

particularly preferred is mammalian protein disulfide isomeras^ Most preferably, rat, 
human, bovine or porcine protein disulfide isomerases are contemplated to be useful in 
the practice of the invention. 

The signal sequence employed in the practice of the invention may be any such 

10 sequence which encodes a signal capable of directing the export of the fusion protein to 

the bacterial periplasm or outer membrane, or alternatively into the culture supernatant in 
which such cells are grown. Typically, the signal sequence, or Leader peptide, may be 
any of those well-known to those of skill in the art to be capable of directing the export 
of proteins in vivo in bacterial cells. A particularly preferred sequence is the E. coli 

15 alkaline phosphatase OmpA signal sequence, but equally preferred signal sequences 

include the Lpp, LamB, MalE, PelB or StII signal sequences and the like. 

2.2 Expression Systems for Producing Eukaryotic Polypeptides in 
Bacteria 

In another preferred embodiment, the invention concerns an expression system 
that expresses both a disulfide isomerase such as one of the disulfide isomerases 
described herein, and an eukaryotic recombinant polypeptide of interest. The expression 
system has been used to produce recombinant OmpA-tPA and OmpA-BPTI when the 
OmpA-PDI polypeptide was coexpressed in the same recombinant host cells. The 
expression system is useful in the expression of recombinant polypeptides which have 
multiple disulfide bonds, even up to and including those with fourteen disulfide bonds. 

The expression system in a general sense is composed of two expression units: 
one containing a DNA segment which encodes the disulfide isomerase fusion protein, 
and a second unit containing a DNA segment which encodes the recombinant fusion 
protein of interest. As described above, the two expression units may either be contained 
on a single recombinant vector, or alternatively, may be contained on two separate and 
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distinct recombinant vectors. In the case of the latter, any two recombinant vectors may 
be utilized so long as the replicons are compatible in the same host cell and that the 
expression unit of each vector functions in the same cell to permit the co-expression of 
the two expression units. For example, the inventors contemplate the coexpression of 
pLPPsOmpArPDI and pACYCBPTI to be particularly useful in the production of BPTI 
from bacterial host cells, and the coexpression of pLPPsOmpArPDI and pTPA177 to be 
particularly useful in the production of tPA from bacterial host cells. 

Expression of the fusion proteins may be promoted by any of a number of 
suitable promoter sequences which are well-known to promote the transcription of genes 
and/or operons in bacterial cells. In preferred embodiments, the DNA segments of the 
present invention are expressed from a lac-lpp y tac, ara, lac, trc, PhoA P BAD , X PL , Ipp, or 
T7 promoter. The components of the expression system described herein may be located 
on separate recombinant vectors with each transcriptional unit under the control of its 
own promoter, or alternatively, the components of the expression system may be located 
within a single recombinant vector. In the latter case, the disulfide isomerase-encoding 
transcriptional unit may be controlled by one promoter, while the recombinant disulfide 
bond-containing polypeptide-encoding transcriptional unit may be controlled by a 
separate promoter, or alternatively, the two transcriptional units may be in the form of a 
"tandem" transcriptional unit with both being controlled by a single promoter located 5' 
of both coding regions. The inventors have found the lac-lpp promoter to be particularly 
useful in the practice of the present invention. 

Once a suitable (full length if desired) clone or clones have been obtained, 
whether they be cDNA based or genomic, one may proceed to prepare an expression 
system for the recombinant preparation of the eukaryotic polypeptides of the present 
invention. The engineering of DNA segment(s) for expression in a prokaryotic or 
eukaryotic system may be performed by techniques generally known to those of skill in 
recombinant expression. It is believed that expression in bacterial hosts, and E. coli in 
particular, will be preferred in the expression of high levels of correctly folded, active, 
soluble disulfide bond-containing eukaryotic polypeptides. 

The cDNAs for such foldases and disulfide-containing proteins of interest may be 
separately expressed in bacterial systems, with the encoded proteins being expressed as 
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fusions with p-galactosidase, ubiquitin, Schistosoma japonicum glutathione S- 
transferase, and the like. It is believed that bacterial expression using the methods 
described will ultimately have advantages over present prokaryotic expression systems 
which produce limited quantities of the proteins of interest intracellular^, inactive, in the 
form of inclusion bodies. This is particularly true for preparations of recombinant tPA. 

It is proposed that transformation of host cells with DN^ segments encoding the 
foldase (such as PDI) and the protein of interest will provide a convenient means for 
obtaining high levels of active secreted polypeptide. However, separate expression 
followed by reconstitution or reactivation of the protein once secreted is also certainly 
within the scope of the invention. For example, the inventors contemplate that the 
extracellular addition of thiol reagents such as glutathione will be useful in enhancing the 
recovery of certain proteins in large quantity using the methods described herein. Both 
cDNA and genomic sequences are suitable for expression, as the host cell will, of course, 
process the genomic transcripts to yield functional mRNA for translation into protein. 

For expression in this maimer, one would position the coding sequences adjacent 
to and under the control of the promoter. It is understood in the art that to bring a coding 
sequence under the control of such a promoter, one positions the 5' end of the 
transcription initiation site of the transcriptional reading frame of the protein between 
about 1 and about 50 nucleotides "downstream" of {i.e., 3' of) the chosen promoter. 

In accordance with the general guidelines described above, a preferred method for 
expressing human tPA DNA has been found to be the transformation of E. coli SF103 
cells with the expression vectors termed pTPAl 77 and pLPPsOmpArPDI. The pTPAl 77 
expression vector is constructed from pACYC184, and contains the OmpA leader-tPA 
gene fusion. 

Likewise, a preferred method for expressing bovine PTI DNA has been found to 
be the transformation of E. coli SF103 cells with the expression vectors termed 
pACYCBPTI and pLPPsOmpArPDL The pACYCBPTI expression vector is constructed 
from pACYC184, and contains the OmpA leader-BPTl gene fusion. 

pLPPsOmpArPDI contains the gene for the mature rat PDI fused to the OmpA 
signal sequence under the control of the Ipp-lac promoter. 
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Preferred expression systems for the production of recombinant proteins may be 
contained either on a single plasmid vector containing the isomerase and disulfide. 

It is contemplated that the recombinant polypeptides of the invention may be 
"overexpressed", i.e., expressed in increased levels relative to its natural expression in 
eukaryotic cells, or even relative to the expression of other proteins in the recombinant 
prokaryotic host cells. Such overexpression may be assessed by a variety of methods, 
including radio-labeling and/or protein purification. However, simple and direct 
methods are preferred, for example, those involving SDS/PAGE and protein staining or 
Western blotting, followed by quantitative analyses, such as densitometric scanning of 
the resultant gel or blot. A specific increase in the level of the recombinant protein or 
peptide in comparison to the level in native cells is indicative of overexpression, as is a 
relative abundance of the specific protein in relation to the other proteins produced by the 
host cell and, e.g., visible on a gel. 

As used herein, the term "engineered" or "recombinant" cell is intended to refer to 
a cell into which a recombinant gene, such as a gene encoding the foldase along with a 
gene encoding a disulfide bond-containing polypeptide of interest (e.g., tPA or BPTI) 
have been introduced. Therefore, engineered cells are distinguishable from naturally 
occurring cells which do not contain a recombinantly introduced gene. Engineered cells 
are thus cells having a gene or genes introduced through the hand of man. 
Recombinantly introduced genes will either be in the form of a cDNA gene (i.e., they 
will not contain introns), a copy of a genomic gene, or will include genes positioned 
adjacent to a promoter not naturally associated with the particular introduced gene. 

Generally speaking, it may be more convenient to employ as the recombinant 
gene a cDNA version of the gene. It is believed that the use of a cDNA version will 
provide advantages in that the size of the gene will generally be much smaller and more 
readily employed to transfect the targeted cell than will a genomic gene, which will 
typically be up to an order of magnitude larger than the cDNA gene. However, the 
inventors do not exclude the possibility of employing a genomic version of a particular 
gene where desired. 

Where the introduction of a recombinant version of one or more of the foregoing 
genes is required, it will be important to introduce the gene such that it is under the 
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control of a promoter that effectively directs the expression of the gene in the cell type 
chosen for engineering. In general, one will desire to employ a promoter that allows 
constitutive (constant) expression of the gene of interest. Commonly used constitutive 
promoters are generally viral in origin, and include the cytomegalovirus (CMV) 
5 promoter, the Rous sarcoma long-terminal repeat (LTR) sequence, and the SV40 early 

gene promoter. The use of these constitutive promoters will ensyre a high, constant level 
of expression of the introduced genes. The inventors have noticed that the level of 
expression from the introduced gene(s) of interest can vary in different clones, probably 
as a function of the particular recombinant gene construct used. Thus, the level of 
1 0 expression of a particular recombinant gene can be chosen by evaluating different clones 

derived from each transformation study; once that line is chosen, the constitutive 
promoter ensures that the desired level of expression is permanently maintained. 

23 Recombinant Vectors for Expressing Eukaryotic Polypeptides in 

15 Bacteria 

In a related embodiment, the invention discloses a recombinant vector comprising 
a first transcriptional unit encoding a mammalian protein disulfide isomerase operatively 
linked to a signal sequence and a second transcriptional unit comprising a DNA segment 
encoding a mammalian tissue plasminogen activator or a mammalian pancreatic trypsin 

20 inhibitor. 

A preferred plasmid for the expression of a protein disulfide isomerase 
transcriptional unit is pLPPsOmpArPDI. * 

A preferred plasmid for cloning the eukaryotic "target" polypeptide-encoding 
DNA fragment is pACYCl 84, although any other vector which may be maintained in the 
25 bacterial host and is compatible with the PDI -encoding expression vector may also be 

used. In particular, pACYC184 was used to create pACYCBPTI and pTPA177 which 
contain DNA sequences encoding bovine pancreatic trypsin inhibitor and human tissue 
plasminogen activator, respectively. 
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2.4 Recombinant Host Cells 

In particular, the invention provides recombinant Gram-negative host cells, 
preferably E. coli or Salmonella spp., transformed with nucleic acid segments encoding 
eukaryotic disulfide bond-containing polypeptides and an eukaryotic foldasc from which 
5 the correctly-folded, active disulfide bond-containing polypeptide may be isolated. In 

sharp contrast to native un-engineered host cells, these transformed host cells have the 
ability to catalyze the formation and isomcrization of disulfide bonds in eukaryotic 
proteins. Wild-type unengineered bacteria cannot normally form the correct folded 
structure of these proteins due to an inability to isomerize disulfide bonds in eukaryotic 
1 0 proteins. 

The foldase is preferably a disulfide isomerase, and more preferably a protein 
disulfide isomerase. Such PDIs may be isolated from mammalian cells, plant cells, 
mycelial fungi, or yeast cells. Eukaryotic foldases such as the yeast Eugl and its 
homologs are also contemplated to be useful in the practice of the present invention. 
15 Particularly preferred eukaryotic PDIs are obtained from mammalian or yeast sources. 

Exemplary mammalian sources for the foldases include human, bovine, rodent, porcine, 
equine, and lupine mammals. Exemplary yeast sources for the foldases include 
Saccharomyces spp. Pichia spp. (in particular Pichia pastoris) and Candida spp. 

Prokaryotic hosts are preferred for expression of the proteins of the present 
invention. Some examples of prokaryotic hosts are E. coli strain SF103, RR1, LE392, B, 
X 1776 (ATCC 31537) as well as E. coli W3110 (F\ X, prototrophic, ATCC 273325). 
Enterobacteriaceae species such as Salmonella typhimurium and Serratia marcescens 
and various Pseudomonas species may also be used. 

In general, plasmid vectors containing replicon and control sequences which are 
25 derived from species compatible with the host cell are used in connection with these 

hosts. The vector ordinarily carries a replication site, as well as marking sequences 
which are capable of providing phenotypic selection in transformed cells. For example, 
E coli is typically transformed using pBR322, a plasmid derived from an E. coli species 
(Bolivar et al., 1977), or pACYC184 as described above. pBR322 contains genes for 
ampicillin (Amp) and tetracycline (Tet) resistance and thus provides easy means for 
identifying transformed cells. pBR322, its derivatives, or other microbial plasmids or 
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bacteriophage may also contain, or be modified to contain, promoters which can be used 
by the microbial organism for expression of endogenous proteins. 

In addition, phage vectors containing replicon and control sequences that are 
compatible with the host microorganism can be used as transforming vectors in 
5 connection with these hosts. For example, bacteriophage such as A.GEM™-11 may be 

utilized in making a recombinant vector which can be used tatr^isform susceptible host 
cells such as E. coli LE392. 

Those promoters most commonly used in recombinant DNA construction include 
the P-lactamase (penicillinase) and lactose promoter systems (Chang et al. y 1978; Itakura 
10 et aL y 1977; Gocddel et al., 1979) or the tryptophan (trp) promoter system (Goeddel et 

al. 9 1980). While these are the most commonly used, other microbial promoters have 
been discovered and utilized, and details concerning their nucleotide sequences have 
been published, enabling a skilled worker to ligate them functionally with plasmid 
vectors. 

15 In addition to prokaryotes, eukaryotic microbes, such as yeast cultures may also 

be employed in various aspects of the present invention. Saccharomyces cerevisiae, or 
common baker's yeast is the most commonly used among eukaryotic microorganisms, 
although a number of other strains are commonly available. For expression in 
Saccharomyces , the plasmid YRp7, for example, is commonly used (Stinchcomb et 

20 1979; Kingsman et aL 9 1979; Tschemper et al, 1980). This plasmid already contains the 

trpL gene which provides a selection marker for a mutant strain of yeast lacking the 
ability to grow in tryptophan, for example ATCC 44076 or PEP4-1 (Jones, 1977). The 
presence of the trpL lesion as a characteristic of the yeast host cell genome then provides 
an effective environment for detecting transformation by growth in the absence of 

25 tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 3- 
phosphoglycerate kinase (Hitzeman et al. y 1980) or other glycolytic enzymes (Hess et al. y 
1968; Holland el al. y 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphatc 

30 isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 

phosphoglucose isomerase, and glucokinase. In constructing suitable expression 
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plasmids. the termination sequences associated with these genes are also ligated into the 
expression vector 3' of the sequence desired to be expressed to provide polyadenylation 
of the mRNA and termination. Other promoters, which have the additional advantage of 
transcription controlled by growth conditions are the promoter region for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated 
with nitrogen metabolism, and the aforementioned j^ceraldehyde-3-phosphatc 
dehydrogenase, and enzymes responsible for maltose and ^lactose utilization. Any 
plasmid vector containing a yeast-compatible promoter, an origin of replication, and 
termination sequences is suitable. 



2.5 Isolation of Soluble Eukaryotic Polypeptides from Bacterial Cells 
Another aspect of the invention concerns the isolation of biologically-active 
recombinant eukaryotic disulfide bond-containing polypeptides from the soluble fraction 
of bacterial cells. 

The inventors have demonstrated that methods described herein may result in 
secretion of the recombinant fusion polypeptides to the bacterial periplasmic space. It is 
also contemplated that particular gene constructs may be utilized which alternatively 
direct the export of the fusion proteins of interest to either the outer membrane or even 
result in the secretion of the fusion proteins to the culture supernatant, from which the 
particular polypeptides may be isolated using conventional techniques for the isolation 
and purification of proteins. Particularly preferred cells for use in the practice of the 
invention include Gram-negative species, and in particular, members of the 
Enterobacteriaceae, with E. coli and Salmonella spp. cells being particularly preferred. 
Most preferred strains for use in the practice of the invention include E. coli strains such 
25 as SF103, SF110, UT5600, or RB791 (as disclosed in U. S. Patent 5,508,192, 

specifically incorporated herein by reference). 

The invention also provides for compositions comprising a biologically-active, 
tissue plasminogen activator operatively linked to a bacterial export signal peptide, and 
compositions comprising a biologically-active, pancreatic trypsin inhibitor operatively 
30 linked to a bacterial export signal peptide. 
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Using the methods disclosed herein, the inventors have developed novel 
compositions comprising a soluble biologically-active tissue plasminogen activator 
operatively linked to a bacterial export signal peptide which has a specific activity of 5- 
30 |ig/L/OD 600 unit of cell culture. The tPA composition obtained was isolatable from 
the bacterial periplasm. 



2.6 Purification of Eukaryotic Proteins from the Bacterial Periplasm 

Because wild-type prokaryotic hosts lack the appropriate enzymes to perform 
isomerization, and because native bacteria cannot secrete properly folded and active 
forms of eukaryotic polypeptides, a significant limitation in the purification of valuable 
recombinant proteins has existed. The present invention, however, overcomes 
limitations in the art by providing recombinant host cells which produce correctly-folded, 
biologically-active eukaryotic proteins in soluble, secreted form. 

The recombinant proteins of the present invention may contain multiple disulfide 
bonds. Particularly preferred are proteins which contain at least three disulfide bonds or 
more. More particularly, preferred proteins include those eukaryotic proteins which 
contain at least about five or more, or even twelve or more, or most preferably even 
about fourteen to about seventeen or more disulfide bonds. The inventors have 
demonstrated success with the method expressing proteins having fewer than four 
disulfide bonds (such as mammalian pancreatic trypsin inhibitor), and surprisingly have 
demonstrated success with proteins having fourteen or more disulfide bonds, such as 
mammalian tPA. * 

Surprisingly, the inventors have determined that the correct formation of disulfide 
bonds in proteins of interest can be mediated by engineering prokaryotic host cells to 
express an eukaryotic foldase (and in particular disulfide isomerases) in conjunction with 
the specific eukaryotic protein of interest which contain disulfide bonds. This co- 
expression of a foldase enzyme and the disulfide bond-containing peptide of interest now 
permits the efficient production of active peptides in recombinant bacterial host cells. 

By expressing eukaryotic foldases, and particularly disulfide isomerases, in 
bacteria concomitantly with nucleic acid segments encoding particular recombinant 
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proteins of interest, yields of the recombinant eukaryotic proteins by prokaryotic hosts 
have been remarkably increased. 

In preferred embodiments, the novel methods disclosed herein have employed 
bacterial cells such as E. coli to produce high yields of multi-disulfide bond-containing 
eukaryotic enzymes. For example, the inventors have succeeded in producing significant 
quantities of active, correctly folded tPA in a bacterial cell. TJs protein has 14 disulfide 
bonds that must form correctly in order for the protein to be active. The invention is the 
first demonstration of production of significant quantities of tPA from bacterial host cells 
which is not associated with insoluble intracellular inclusion bodies. Likewise, the 
invention has been used to facilitate the production of another commercially important 
enzyme, pancreatic trypsin inhibitor, and in particular, bovine PTI, using bacterial hosts 
which co-express an eukaryotic foldase such as PDI. The invention represents a 
breakthrough in the production of commercial quantities of such multi-disulfide bond- 
containing proteins of economic interest by providing rapid, inexpensive methodologies 
for secretion of such proteins by the engineered bacterial cells. 

In another preferred embodiment, the invention has demonstrated that 
recombinant host cells devoid of intrinsic prokaryotic PDI-like activity may be 
successfully complemented using the eukaryotic foldase-encoding nucleic acid 
compositions disclosed herein. The inventors have demonstrated that both rat and yeast- 
derived PDI-encoding DNA segments may be used to provide disulfide bond isomerizing 
activity to bacterial strains devoid of such activity. 

Further aspects of the present invention concern the purification, and in particular 
embodiments, the substantial purification, of a disulfide bond-containing polypeptide, 
and in particular a purified tPA or purified PTI protein composition. The term "purified 
tPA" as used herein, is intended to refer to a tPA composition, isolatable from 
recombinant bacterial host cells, wherein the tPA is purified to any degree relative to its 
naturally-obtainable state, Le. 9 in this case, relative to its purity within a cell extract. A 
purified tPA composition therefore also refers to a tPA polypeptide, free from the 
environment in which it may naturally occur or from the recombinant host cell in which 
it was produced. Likewise, the term "purified PTI" as used herein, is intended to refer to 
a PTI composition, isolatable from recombinant bacterial host cells, wherein the PTI is 
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purified to any degree relative to its naturally-obtainable state, i.e., in this case, relative to 
its purity within a cell extract. A purified PTJ composition therefore also refers to a PTI 
polypeptide, free from the environment in which it may naturally occur or from the 
recombinant host cell in which it was produced. 
5 Generally, "purified" will refer to a tPA or PTI polypeptide composition which 

has been subjected to fractionation to remove various recembin^ht host cell components, 
and which composition substantially retains its tPA or PTI activity. Where the term 
"substantially purified" is used, this will refer to a composition in which the protein of 
interest forms the major component of the composition, such as constituting about 25%, 

10 about 50%, or even about 75% or greater of the soluble proteins isolated from the 

periplasm of the recombinant host cells described herein. 

Various methods for quantifying the degree of purification of such peptides will 
be known to those of skill in the art in light of the present disclosure. These include, for 
example, determining the specific activity of an active fraction, or assessing the number 

15 of polypeptides within a fraction by SDS/PAGE analysis. For example, a preferred 

method for assessing the purity of a tPA composition is to calculate the specific activity 
of the fraction containing the tPA composition, to compare it to the specific activity of 
the initial soluble protein extract, and to thus calculate the degree of purity, herein 
assessed by a "-fold purification number". 

20 As is generally known in the art, to determine the specific activity, one would 

calculate the number of units of activity per milligram of total protein. In the purification 
procedure, the specific activity of the starting material, ne. t of the soluble periplasmic 
extract, would represent the specific activity of the protein of interest in its un-purified 
state. At each step in the purification and concentration of the protein of interest, one 

25 would generally expect the specific activity of the particular enzyme to increase above 

this value, as it is purified relative to its un-purified state. In preferred embodiments, it is 
contemplated that one would assess the degree of purity of a given periplasmic fraction 
comprising recombinant tPA or PTI by comparing its specific activity to the specific 
activity of the starting material, and representing this as x-fold purification. The use of 

30 "fold purification" is advantageous as the purity of an inhibitory fraction can thus be 
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compared to another despite any differences which may exist in the actual units of 
activity or specific activity. 

Generally, "purified" will refer to a protein or polypeptide composition which has 
been subjected to fractionation to remove various non-peptidc components such as other 
5 cell components. Various techniques suitable for use in protein purification will be well 

known to those of skill in the art. These include, for example, precipitation with 
ammonium sulfate, PEG, antibodies and the like or by heat^enaturation, followed by 
centrifugation; chromatography steps such as ion exchange, gel filtration, reverse phase, 
hydroxylapatite and affinity chromatography; isoelectric focusing; gel electrophoresis; 
1 0 and combinations of such and other techniques. 

As mentioned above, although preferred for use in certain embodiments, there is 
no general requirement that the recombinant polypeptides always be provided in their 
most purified state. Indeed, it is contemplated that less substantially purified 
polypeptides, which are nonetheless enriched in activity relative to the natural state, will 
have utility in certain embodiments. Partially purified disulfide bond-containing 
recombinant polypeptide fractions for use in such embodiments may be obtained by 
subjecting a recombinant host cell periplasmic fraction to one or a combination of the 
purification steps commonly used for their purification from soluble fractions as 
described above. 
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2.7 Nucleic Acid Segments Encoding Eukaryotic Fusion Polypeptides 

The process of selecting and preparing a nucleic acid segment which includes the 
preferred nucleic acid sequences encoding the peptides of interest is well-known to those 
of skill in the art. This may alternatively be described as preparing a nucleic acid 
fragment, or cloning a specific gene. Of course, fragments may also be obtained by other 
techniques such as, e.g., by mechanical shearing or by restriction enzyme digestion. 
Small nucleic acid segments or fragments may be readily prepared by, for example, 
directly synthesizing the fragment by chemical means, as is commonly practiced using an 
automated oligonucleotide synthesizer. Also, fragments may be obtained by application 
of nucleic acid reproduction technology, such as the PCR™ technology of U. S. Patents 
4,683,195 and 4,683,202 (incorporated herein by reference), by introducing selected 
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sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular" 
biology. 

5 3. Brief Description of the Drawings 

The drawings form part of the present specification an^ are included to further 
demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

10 FIG. 1. Expression of rPDI in a dsbA mutant. Bacterial cultures JCB570 

(dshA*) and JCB571 {dsbA') were grown overnight at 37°C and diluted 100 fold into 
fresh media. After 30 min, the cultures were divided in two equal parts, one of which 
received IPTG at a final concentration of 0.5 mM. Cells were harvested at OD 600nm = 0.4 
and fractionated by osmotic shock, electrophoresis on the osmotic shock supernatant 

15 was carried out on 12.5% acrylamide gels under reducing conditions. The rPDIf 

fragment, which represents a C-terminal fragment of rPDI including the second active 
site, arises due to an internal translation initiation within rpdi (Dc Sutter et al. % 1994). 

FIG. 2A. rPDI rescues phenotypes of dsbA mutants. F-pilus assembly. 
Uninduced cells were either infected with the filamentous phage JB4 (Cm r M13) or used 

20 as the donor for conjugation with SF103 as the recipient. Strains: w.t. = JCB502F'; 

dsbA' = JCB572 F'; Plasmids: rPDI = pLPPsOmpArPDI; control = pTI103. Data 
represent the average from three independent studies. 

FIG. 2B. rPDI rescues phenotypes of dsbA mutants. Growth in M9 minimal 
media. Solid symbols: JCB570 (dsbA\ open symbols: JCB571 (dsbA ); (closed circle, 

25 open circle) = no plasmid, (closed triangle, open triangle,A) = pLPPsOmpArPDI, (£>,y) = 

pTl 1 03 (control plasmid). 

FIG. 3A. Shown is the rPDI rescue of disulfide formation in dsbA' cells. 
Oxidation of alkaline phosphatase in dsbA' and dsbB' cells with or without rPDI. 
Bacterial cultures JCB570 (dsbA* dsbB*\ JCB571 (dsbA' dsbB') JCB789 (dsbA* dsbB ) 

30 and JCB758 (dsbA' dsbB') were labeled with 35 S-TransLabel for 45 sec. followed by a 10 

min chase with excess methionine and cysteine. Proteins were precipitated with 10% 
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trichloroacetic acid, treated with iodoacetamide and immunoprccipitated with antisera 
against alkaline phosphatase. Immune complexes were resolved by electrophoresis 
under non-reducing or reducing conditions and visualized by autoradiography. The 
positions of reduced (red-AP), oxidized (ox-AP) and precursor (pre-AP) are indicated. 
5 FIG - 3B - Shown are the effects of rPDI on disulfide formation in dshB' 

cells. Oxidation of alkaline phosphatase in dsbA' and dsbB" cells with or without rPDI. 
Bacterial cultures JCB570 {dsbA" dsbB 1 ), JCB571 {dsbA' ds^B*) JCB789 {dsbA' dsbB ) 
and JCB758 (dsbA' dsbB ) were labeled with 3S S-TransLabel for 45 sec. followed by a 10 
min chase with excess methionine and cysteine. Proteins were precipitated with 10% 
10 trichloroacetic acid, treated with iodoacetamide and immunoprecipitated with antisera 

against alkaline phosphatase. Immune complexes were resolved by electrophoresis 
under non-reducing or reducing conditions and visualized by autoradiography. The 
positions of reduced (red-AP), oxidized (ox-AP) and precursor (pre-AP) are indicated. 

FIG. 4. Coexpression of rPDI improves the yield of BPTI as detected by 

15 ELISA. Bacterial cultures JCB570 containing pACYCBPTI (open bars) or 

pLPPsOmpArPDI and pACYCBPTI (solid bars) were induced with 0.1 mM IPTG at 
OEWnn, = 0.3-0.35. GSH and/or GSSG as indicated was added 20 min after induction. 
Five hours after induction the cells were harvested and the concentration of BPTI in the 
soluble fraction was detected by ELISA. The data reported here represent the average of 
three independent studies; for each sample ELISAs were performed at least in triplicate. 

FIG.5A. Western blot showing expression of BPTI with and without 
coexpression of rPDI. Bacterial cultures JCB570 (wild type), JCB571 {dsbA ) and 
JCB789 (^fi>arrying pACYCBPTI were induced (except those marked with a "U") 
with 0.1 mM IPTG at OD 600nm = 0.3-0.35. GSH and GSSG at the indicated mM 
25 concentrations were added 20 min after induction. Five hours after induction the cells 

were harvested and the soluble fraction affinity precipitated with trypsin-agarose beads, 
electrophoresed through 16% SDS-PAGE Tricine gels and detected by Western blot. 
Shown are wild-type cells. Lane 1-2 soluble fraction of non-plasmid containing wild 
type cells with (X) and without (Y) an addition of 2 ug BPTI. 
30 FIG.5B. Western blot showing expression of BPTI with and without 

coexpression of rPDI. Bacterial cultures JCB570 (wild type), JCB571 {dsbA') and 
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JCB789 (<&Z>5")carrying pACYCBPTI were induced (except those marked with a "U") 
with 0.1 mM IPTG at OD 600 nm = 03-0.35. GSH and GSSG at the indicated mlvT 
concentrations were added 20 min after induction. Five hours after induction the cells 
were harvested and the soluble fraction affinity precipitated with trypsin-agarose beads, 
5 electrophoresed through 16% SDS-PAGE Tricine gels and detected by Western blot. 

Shown are Dsb mutant cells. Symbols: U = uninduced cells; [^p = preOmpABPTI; # = 
degradation product of preOmpArPDl; rPDl = carrying plasmid pLPPsOmpArPDI. 

FIG. 6. Accumulation of two disulfide folding intermediates of BPTI in E. 

coli coexpressing rPDI as monitored by non-reducing electrophoresis. Proteins 
10 immunoprecipitated from cultures labeled with L-[ 35 S]cysteine for 1 min and quenched 

with 100 mM iodoacetamide were resolved on non-reducing gels containing 8 M urea. 
Times indicated are min after the chase. Lanes 1-4, no addition of glutathione. Lanes 
5-8, 2 mM GSH added 30 min before pulse. The positions of preOmpA-BPTI (Pre), 
reduced carboxymethylated BPTI (R), native BPTI (N) and two disulfide intermediates 
15 (*) are indicated. 

FIG. 7. Restriction maps of the vectors pACYCBPTI and ptPAl 77. 

FIG. 8. PhoA activity in R189 and R190 cells containing either no 

plasmid or pYPDl. 

20 4. Description of I llustrati ve Embodiments 

4.1 Protein Folding In Vivo 

Extensive studies of the physicochemical aspects of the protein folding problem 
over the last 40 years have shed light on the nature of the rate limiting steps during the 
formation of the native conformation. Rate limiting steps in folding include non- 
25 covalent processes, such as the alignment of protein subdomains, subunit assembly etc., 
and covalent processes, for example disulfide bond formation, peptidylproline 
isornerization, proprotein processing and others. Jn-vivo y the rate of the covalent steps is 
catalyzed by a host of cellular enzymes. On the other hand, partially folded species that 
accumulate because of slow conformational changes are protected from non-productive 
30 interactions by the action of chaperones (Gething and Sambrook, 1992). The 
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reductionist approach of analyzing in vitro each of these steps separately has elucidated 
many mechanistic aspects of the folding problem. However, the folding of proteins- in 
nature occurs in the concentrated milieu of the cellular environment and is probably 
coupled to biosynthesis and/or export. These additional factors together with the 
multitude of processes that affect folding in the first place, make it practically impossible 
to understand how folding proceeds in the cell solely from in vitro data. 

The regulation of protein folding is essential for the^ormal function of the cell 
and its response to different environments. In addition, protein folding underlies the 
cause of many human disorders (Wetzel, 1994). A detailed analysis of the in vivo 
folding pathway can shed light on the influence of various cellular parameters and the 
nature of rate limiting steps in the folding of proteins in the cell. Such information will 
ultimately elucidate the molecular basis of this major biological mechanism and could 
have considerable medical implications. 

At present, the protein for which the most detailed information on the kinetics 
and energetics of folding and on the structure of all the key intermediates is available, is 
BPTI. BPTI can be secreted to the periplasmic space of E. coli where it folds to the 
native state. Even though BPTI is a heterologous protein, it interacts with bacterial 
components and its folding is absolutely dependent on the function of E. coli 
oxidoreductases (Ostermeier and Georgiou, 1 994). 

4.2 Protein Folding In Vitro 

The refolding of proteins from denaturant solutions is a spontaneous process 
directed by the amino acid sequence and the solvent conditions (Matthews 1993; Fersht, 
1993). However, even though folding is thermodynamically favored, the yield of the 
native protein upon refolding in vitro can range from almost 0 to 1 00%, and the time 
required for renaturation from milliseconds to days. The reason for such differences 
relates to the kinetics of the folding process. Exciting recent studies by Fersht and 
coworkers and others (Otzen et al. 1994; Kuszewski et al. 1994) suggest that the 
folding of protein subdomains proceeds according to the global collapse model (Dill et 
al, 1993), via a transition state resembling an expanded form of the native conformation 
but with no fully formed elements of secondary or tertiary structure. However, for more 
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complex proteins other processes such as the alignment of subdomains or covalcnt 
changes introduce rate limiting steps. Covalent changes that limit folding include: cis- 
trans isomerization of peptidylproline bonds, formation of disulfide bonds, proteolytic 
processing of proproteins, heme ligation, etc. In the cell most, if not all of these rate 
5 limiting processes are facilitated by accessory proteins known as foldases and chaperones 

(Gething and Sambrook, 1992). Foldases have a clearly- defined catalytic activity 
whereas chaperones perform multiple functions, the most important of which is 
providing an environment for nascent proteins to fold without the competing process of 
self-association. The distinction is somewhat artificial because some proteins like 

1 0 protein disulfide isomerase function both as foldases and chaperones, at least in vitro. 

The presence of accessory proteins underlies one of the fundamental differences 
between refolding studies and the way in which folding proceeds in the cell. In addition, 
in vitro studies are conducted with highly purified polypeptides that are first unfolded 
and then allowed to relax to their native conformation in dilute solutions. In contrast, the 

15 folding of proteins in the cell is probably coupled to protein synthesis, takes place in a 

much more concentrated environment and is affected by compartmentalization. To 
complicate matters even more, the growth conditions affect folding directly, by 
influencing the rates of conformational and covalent changes, and indirectly, by 
modulating the expression and activity of accessory proteins. 

20 A central question in protein folding is to what extent in vitro studies reflect the 

physiological folding pathway. In the cell, newly synthesized polypeptides can interact 
with a variety of chaperones and foldases and, quite possibly, with other cellular 
components such as membranes, Iigand or substrate molecules, and low-molecular- 
weight solutes. Furthermore, because of the vectorial nature of ribosomal synthesis, and 

25 of the export apparatus in the case of secreted proteins, the initial state from which 

folding commences is non-random. 

The denatured state of the protein can exert a significant influence on the folding 
pathway. Unfortunately, determining the folding pathway in vivo is not a straightforward 
matter because of the paucity of methods for isolating partially folded proteins. Progress 

30 in the elucidation of in vivo folding processes has been possible only for proteins which 

form exceedingly stable intermediates, such as the P22 endoramnosidase (Goldenberg 
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and King, 1982; Mitraki et al., 1991) and for those where partially folded molecules can 
be trapped by blocking the formation of disulfide bonds. Studies by the groups of 
Helenius and Ruddon have employed chemical modification of free thiols to quench 
folding and dissect the pathway of disulfide bond formation in influenza hemagglutin 
5 and human chorionic gonadotropin, respectively (Braakman et al., 1992; Bedows et al., 

1992). However, only for the latter protein is the in vitro folding pathway sufficiently 
well characterized to allow a direct comparison with results ^ in vivo studies (Huth et 
al., 1993). Furthermore, the folding of H. influenza hemagglutin and human 
gonadotropin has been studied in mammalian cells in which the use of genetic techniques 
1 0 for dissecting the role of cellular factors is technically difficult. 

The periplasmic space is the cellular compartment defined by the inner and outer 
membranes of Gram-negative bacteria (Pugsley, 1993). Unlike the cytoplasm, the 
periplasmic space is maintained under strongly oxidizing conditions, thus facilitating the 
formation of disulfide bonds in secreted proteins (Walker and Gilbert, 1 995; Wulfing and 
15 PlOckthun, 1994). The majority of periplasmic proteins are exported across the 

cytoplasmic membrane via the general secretion pathway (Pugsley, 1993) whose 
components include the proteins SecA, SecY, SecE, SecD, SecF and the recently 
discovered SecG (Nishiyama et al., 1994). Newly translocated polypeptides are often 
transiently associated with the external face of the cytoplasmic membrane before they are 
released into the periplasm. There is evidence that- this transient association with the 
membrane may be important for protein folding (Matsuyama et al., 1993). 

The bacterial periplasmic space is topological^ equivalent to the endoplasmic 
reticulum (ER) of eukaryotic cells and, just like in the ER, folding is modulated by the 
action of several proteins. In addition, the periplasmic PapD chaperone is essential for 
the assembly of type I pili (Huitgren et al, 1993) and it has been proposed that ClpB 
may also function as a chaperone (Wulfing and Pluckthun, 1994). An important 
difference between periplasmic and cytoplasmic chaperones is that, unlike GroEL and 
DnaK, the binding and release of polypeptide substrates from periplasmic chaperones 
cannot involve ATP binding/hydrolysis since there is no evidence for a high energy 
30 phosphate donor in the periplasm. 
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A protein with cis-trans proline isomerase activity (rotamase) has been isolated 
from E. coli periplasm ic fractions and has been shown to be active in a protein refolding 
assay (Wulfing and Pluckthun, 1994). However, the in vivo function of this protein has 
not yet been elucidated. 

The formation of disulfide bonds is catalyzed by the multicomponent dsb system. 
The dsb genes have been identified by genetic analysis, So^pr three proteins. DsbA, 
DsbB and DsbC have been characterized in some detail (Bardwell, 1994, Shevchik et ai. 
1994, Missiakas et oi, 1994). Three additional genes that confer resistance to DTT have 
been isolated and named dsbD, dsbE and dshF. Null mutations in dsbD, dshE and dsbF 
are known to affect the formation of disulfide bonds in native proteins but little other 
information is currently available. 

43 BPTI 

One of the best characterized folding pathways in vitro is that of BPTI. The 
folding pathway has been elucidated through the isolation and characterization of the 
one- and two-disulfide intermediates that occur during folding (Creighton and 
Goldenberg, 1984; Weissman and Kim, 1991; Goldenberg, 1992). The main steps in this 
pathway are shown below: 

R I II * NSH.SH N 

\ / - 

N* 

Scheme 1 

where R is the fully reduced protein, 1 represents the various one-disulfide 
species, II represents the native like intermediate N'(30-51, 14-38) as well as 
two-disulfide species with one non-native and one native disulfide, N* represents the 
kinetic trap (5-55, 14-38), N sasH is the native-like species (3051, 5-55) and N is the 
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native protein with three disulfides. The rale-limiting step in folding is the conversion of 
II to N' MM ' which is then rapidly oxidized to the native protein. The structure of all 
major doling intermediates has been extensively characterized using 'H and l5 N NMR 
(Van Mierlo et a/., 1993). N*, and to a lesser extent N\ have a native-like conformation 
and therefore the activation energy for the conformational change required to position the 
two thiols in close proximity is high and thus the rate of rearrangement to N SH SH is slow. 
The unpaired cysteines of N* are buried within the interior ofS e protein. Consequently, 
N* is exceedingly slow to rearrange to other species and is described as a kinetic trap in 
the folding pathway (Creighton and Goldenberg, 1984). 

4.4 EUKARYOTIC PDI 

PDI is an endoplasmic reticulum enzyme that catalyzes cysteine oxidation and 
disuJfide bond rearrangement and is essential for cell viability in Saccharomyces 
cerevisiae (Freedman, 1989; Novia et a/., 1991). In vitro, PDI has been shown to 
catalyze all steps in the BPTI folding pathway (Creighton et ai, 1980) including the 
rearrangement of N' and N* to the labile intermediate N SHSH under conditions that 
resemble those of the endoplasmic reticulum (Weissman and Kim, 1993). The folding of 
BPTI is accelerated by the presence of total protein from the endoplasmic reticulum, a 
phenomenon which has been shown to be totally accounted for by the activity of protein 
disulfide isomerase (Zapun et al, 1992). Creighton and co-workers (1993) studied the 
folding of proBPTI produced by an in vitro translation system and imported into 
microsomes. proBPTI consists of the mature protein witR a 13-residue extension on the 
N-terminal and 7 residue extension at the C-terminal. Its folding was found to be 
dependent on the presence of GSSG. The formation of native protein was found to be 
substantially higher than in vitro with complete folding occurring within 1 min under 
strongly oxidizing conditions (10 mM GSSG) and within 2 min with 4 mM GSSG. It 
was postulated that the higher rate of folding in microsomes was due to PDI which was 
shown to catalyze both disulfide bond formation and rearrangement in proBPTI 
(Creighton et at. , 1 993). 

The periplasmic space of E. coli is topological^ equivalent to the endoplasmic 
reticulum. It is maintained at a redox state favoring the formation of disulfide bonds, a 
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process which is accelerated by at least two interacting proteins, DsbA and DsbB 
(Bardwell and Beckwith, 1993; Bardwell et al. n 1993). The three-dimensional structure 
of DsbA was solved recently and shown to consist of a thioredoxin-like domain joined to 
a second domain which may be responsible for substrate specificity (Martin et al, 1993). 
5 Similarly, PDI has also been predicted to contain thioredoxin-like domains. Given the 

analogies between the endoplasmic reticulum and the bacterialjperiplasmic space, it is 
not surprising that native BPTI can form in E. coli provided it is expressed with a 
bacterial leader peptide (Marks et al y 1986; GoldenbeTg, 1988). 

Eukaryotic PDI is a 55 kDa enzyme with cysteine oxidoreductase, chaperon and 
10 antichaperon activities that catalyzes disulfide formation and rearrangement in the 

eukaryotic endoplasmic reticulum. In sharp contrast, in Gram-negative bacteria, the 
formation of disulfide bonds in the periplasm is mediated by DsbA, a strong cysteine 
oxidase but an inefficient catalyst of disulfide bond isomerization with no known 
chaperon activity. 

15 The prokaryotic analog was utilized in Eur. Pat. Appl. No. EP 510,658 to direct 

the expression of a single disulfide bond-containing polypeptide, a-amylase/trypsin 
inhibitor (RBI), but unfortunately the method required the addition of critical amounts of 
thiol reagents to the culture medium. The present inventors have shown, however, that 
the prokaryotic enzyme disclosed in Eur. Pat. Appl. No. EP 510,658 was unable to direct 

20 the secretion of significant quantities of either active tPA or active BPTI from bacterial 

host cells. 

Surprisingly, however, when the inventors genetically engineered recombinant 
host cells to contain a eukaryotic foldase, namely, a disulfide isomerase, the mammalian 
enzyme was not only secreted into the periplasmic space of bacterial cells, but was also 

25 able to catalyze the formation of disulfide bonds, and complement several dsbA mutants 

which lacked the bacterial enzyme. The function of rPDI was dependent on the dsbB 
gene suggesting that the reoxidation of this eukaryotic enzyme involves direct 
interactions with bacterial redox proteins. 

Even more importantly, the inventors have demonstrated that co-expression of the 

30 eukaryotic rPDI increased the yield of eukaryotic proteins such as BPTI several fold. 

Whereas PDI is thought to function primarily as an isomerase in the eukaryotic 
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endoplasmic reticulum, rPDI failed to decrease the accumulation of two disulfide folding 
intermediates of BPT1 and thus did not appear to appreciably catalyze the rate limiting 
step in the oxidative folding pathway of BPTI. 

In a breakthrough for protein production in bacterial hosts, the present invention 
now provides novel methods and compositions to facilitate the expression of eukaryotic 
foldases in bacterial cells and provides new expression systems which may now be 
exploited to increase the yield of biologically-valuable eukajotic proteins which have 
not been previously been produced in prokaryotic hosts in biologically-active forms. 

4.5 Methods of Nucleic Acid Delivery and DNA Transformation 

In yet another embodiment, the present invention provides recombinant host cells 
transformed with polynucleotides which encode an eukaryotic foldase, and particular 
disulfide bond-containing polypeptides of interest, as well as transgenic cells derived 
from those transformed or transfected ceils. Preferably, a recombinant host cell of the 
present invention is transformed with a polynucleotide comprising a sequence encoding 
PDI and a polynucleotide comprising a sequence encoding either tPA or PTI. Means of 
transforming cells with exogenous polynucleotide such as DNA molecules are well 
known in the art and include techniques such as calcium-phosphate- or DEAE-dextran- 
mediated transfection, protoplast fusion, electroporation, liposome mediated transfection, 
direct microinjection and adenovirus infection (Sambrook et ai, 1989). 

The application of brief, high-voltage electric pulses to a cell culture leads to the 
formation of nanometer-sized pores in the cell membrane.^DNA is taken directly into the 
cytoplasm either through these pores or as a consequence of the redistribution of 
membrane components that accompanies closure of the pores. Electroporation can be 
extremely efficient and can be used both for transient expression of cloned genes and for 
establishment of cell lines that carry integrated copies of the gene of interest. 
Electroporation, in contrast to calcium chloride-mediated transformation, frequently 
gives rise to high numbers of target cells being transformed with the foreign DNA. 

Liposome transfection involves encapsulation of DNA and RNA within 
liposomes, followed by fusion of the liposomes with the cell membrane. The mechanism 
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of how DNA is delivered into the cell is unclear but transfection efficiencies can be as 
high as 90%. 

4.6 DNA Segments 

In other embodiments, it is contemplated that certain advantages will be gained 
by positioning the DNA segment encoding the fusion polypepti^p under the control of a 
recombinant, or heterologous, promoter. As used herein, a recombinant or heterologous 
promoter is intended to refer to a promoter that is not normally associated with a DNA 
segment encoding the eukaryotic peptide in its natural environment. Such promoters 
may include promoters normally associated with other genes, and/or promoters isolated 
from any viraK prokaryotic (e.g., bacterial), eukaryotic (e.g., fungal, yeast, plant, or 
animal) cell, and particularly those of mammalian cells. Naturally, it will be important to 
employ a promoter that effectively directs the expression of the DNA segment in the cell 
type, organism, or even animal, chosen for expression. The use of promoter and cell type 
combinations for protein expression is generally known to those of skill in the art of 
molecular biology, for example, see Sambrook et ai 9 1989. The promoters employed 
may be constitutive, or inducible, and can be used under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is advantageous in 
the large-scale production of recombinant proteins or peptides. Appropriate 
promoter/expression systems contemplated for use in high-level expression include, but 
are not limited to, the Pichia expression vector system (Pharmacia LKB Biotechnology), 
a baculovirus system for expression in insect cells, or any suitable yeast or bacterial 
expression system. 

The ability of nucleic acid segments to be used as probes to specifically hybridize 
to the disclosed DNA sequences will enable them to be of use in detecting the presence 
of complementary sequences in a given sample. However, other uses are envisioned, 
including the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Nucleic acid molecules having sequence regions consisting of contiguous 
nucleotide stretches of about 14, 15-20, 30, 40, 50, or even of about 100 to about 200 
nucleotides or so, identical or complementary to the DNA sequences disclosed herein, 
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are particularly contemplated as hybridization probes for use in, e.g.. Southern and 
Northern blotting. Smaller fragments will generally find use in hybridization 
embodiments, wherein the length of the contiguous complementary region may be 
varied, such as between about 10 and about 14 or even up to about 25, 50, or 100 
nucleotides, but larger contiguous complementarity stretches may be used, according to 
the length complementary sequences one wishes to detect. 

The use of a hybridization probe of about 14 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having 
contiguous complementary sequences over stretches greater than 14 bases in length are 
generally preferred, though, in order to increase stability and selectivity of the hybrid, 
and thereby improve the quality and degree of specific hybrid molecules obtained. One 
will generally prefer to design nucleic acid molecules having gene-complementary 
stretches of about 15 to about 25 contiguous nucleotides, or even longer where desired. 

Of course, fragments may also be obtained by other techniques such as, e.g., by 
mechanical shearing or by restriction enzyme digestion. Small nucleic acid segments or 
fragments may be readily prepared by, for example, directly synthesizing the fragment by 
chemical means, as is commonly practiced using an automated oligonucleotide 
synthesizer. Also, fragments may be obtained by application of nucleic acid 
reproduction technology, such as PCR™ (exemplified in U. S. Patents 4,683,202 and 
4,683,195), by introducing selected sequences into recombinant vectors for recombinant 
production, and by other recombinant DNA techniques generally known to those of skill 
in the art of molecular biology. 

Accordingly, the nucleotide sequences of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of DNA 
fragments. Depending on the application envisioned, one will desire to employ varying 
conditions of hybridization to achieve varying degrees of selectivity of probe towards 
target sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form the hybrids, e.g., one will select relatively 
low salt and/or high temperature conditions, such as provided by about 0.02 M to about 
0.15 M NaCl at temperatures of about 50EC to about 70EC. Such selective conditions 
tolerate little, if any, mismatch between the probe and the template or target strand, and 
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wouJd be particularly suitable for isolating target DNA segments. Detection of DNA 
segments via hybridization is well-known to those of skill in the art, and the teachings of 
U. S. Patents 4,965,188 and 5,176,995 (each incorporated herein by reference) are 
exemplary of the methods of hybridization analyses. Teachings such as those found in 
the texts of Maloy, 1990; Maloy et aL 1994; Segal, 1976; Prokop, 1991; and Kuby, 
1994, are particularly relevant, providing detailed methods andy>rotocols for molecular 
biology methods, hybridization and instruction enzyme digestion, plasmid construction, 
DNA and RNA sequencing mutant construction and analysis and other related methods. 

Of course, for some applications, for example, where one desires to prepare 
mutants employing a mutant primer strand hybridized to an underlying template or where 
one seeks to isolate eukaryotic polypeptide-encoding sequences from related species, 
functional equivalents, or the like, less stringent hybridization conditions will typically 
be needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ conditions such as about 0.15 M to about 0.9 M salt, al 
temperatures ranging from about 20EC to about 5 5 EC. Cross-hybridizing species can 
thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a method 
of choice depending on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences 
of the present invention in combination with an appropriate means, such as a label, for 
determining hybridization. A wide variety of appropriate indicator means are known in 
the art, including fluorescent, radioactive, enzymatic or other ligands, such as 
avidin/biotin, which are capable of giving a detectable signal. In preferred embodiments, 
one will likely desire to employ a fluorescent label or an enzyme tag, such as urease, 
alkaline phosphatase or peroxidase, insi. J of radioactive or other environmental 
undesirable reagents. In the case of enzyme tags, colorimetric indicator substrates are 
known that can be employed to provide a means visible to the human eye or 
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spectrophotometrically, to identify specific hybridization with complementary nucleic 
acid-containing samples. 

. In general, it is envisioned that the hybridization probes described herein will be 
useful both as reagents in solution hybridization as well as in embodiments employing a 
solid phase. In embodiments involving a solid phase, the test DNA (or RNA) is 
adsorbed or otherwise affixed to a selected matrix or surface. This fixed, single-stranded 
nucleic acid is then subjected to specific hybridization with selected probes under desired 
conditions. The selected conditions will depend on the particular circumstances based on 
the particular criteria required (depending, for example, on the G+C content, type of 
target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). Following 
washing of the hybridized surface so as to remove nonspecifically bound probe 
molecules, specific hybridization is detected, or even quantitated, by means of the label. 

4.7 Expression of Eukaryotic Disulfide-Bond Containing Proteins 

The present inventors contemplate cloning the recombinant polypeptides 
identified herein, and in particular recombinant tPA and PTI polypeptides. A technique 
often employed by those skilled in the art of protein production today is to obtain a so- 
called "recombinant" version of the protein, to express it in a recombinant cell and to 
obtain the protein from such cells. These techniques are based upon the "cloning" of a 
DNA molecule encoding the protein from a DNA library, i.e., on obtaining a specific 
DNA molecule distinct from other portions of DNA. This can be achieved by, for 
example, cloning a cDNA molecule, or cloning a ^enomic-like DNA molecule. 
Techniques such as these would also, of course, be appropriate for the production of a 
disulfide bond-containing polypeptide in accordance with the present invention. 

The first step in such cloning procedures is the screening of an appropriate DNA 
library, such as, in the present case, a rat, human, bovine, or other mammalian-derived 
library. The screening procedure may be an expression screening protocol employing 
antibodies directed against the protein, or activity assays. Alternatively, screening may 
be based on the hybridization of oligonucleotide probes, designed from a consideration 
of portions of the amino acid sequence of the protein, or from the DNA sequences of 
genes encoding related proteins. The operation of such screening protocols arc well 
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known to those of skill in the art and are described in detail in the scientific literature, 
e.g., in Sambrook et ai (1989) specifically incorporated herein by reference. Moreover, 
as the present invention encompasses the cloning of genomic segments as well as cDNA 
molecules, it is contemplated that other suitable methods known to those in the art, such 
as, e.g., those described by Spoerel et ai. (1987), may also be used in connection with 
cloning a disulfide bond-containing polypeptide, or alternativel^&n eukaryotic foldase to 
direct the folding and isomerization of disulfide bonds contained within such 
polypeptides of interest. 

After identifying appropriate DNA molecules, they may be inserted into any one 
of the many vectors currently known in the art and transferred to a prokaryotic or 
eukaryotic host cell where it will direct the expression and production of the so-called 
recombinant version of the protein. This is also, of course, routinely practiced in the art 
and described in various publications, such as, e.g.. Sambrook et ai (1989). Such DNA 
segments may be contained on a single plasmid vector, or alternatively, the foldase may 
be encoded by nucleic acid sequences on one vector and the disulfide bond-containing 
polypeptide of interest may be present on a second plasmid vector which is compatible 
for co-residence in a single host cell with the first plasmid vector comprising the foldase 
sequence. The selection of plasmid vectors is well-known to those of skill in the art, and 
such a selection may be based on the incompatibility grouping of such vectors (IncP, 
IncQ, etc.). Virtually any such plasmid vectors may be used in the practice of the 
invention so long as they are replicable in the appropriate prokaryotic host cell 
employed. In one embodiment, preferred replicons include pACYC184 and pTI103, and 
in particular, the pACYCBPTI and pLPPsOmpArPDI plasmid constructs derived 
respectively, therefrom. 

It will be understood that recombinant disulfide bond-containing polypeptides 
may differ from naturally-produced polypeptides in certain ways. In particular, the 
degree of post-translational modifications, such as, for example, glycosylation and 
phosphorylation may be different between the recombinant and natural forms. 

Recombinant clones expressing nucleic acid segments which encode eukaryotic 
disulfide-bond containing polypeptides may be used to prepare purified recombinant 
polypeptides, purified polypeptide-derived antigens as well as mutant or variant 
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recombinant protein species in significant quantities. In particular, the invention 
provides for the production of recombinant tPA (rtPA) or recombinant PT1 (rPTl) rn 
substantial quantities from bacterial host cells. 

Additionally, by application of techniques such as DNA mutagenesis, the present 
invention allows the ready preparation of so-called "second generation" molecules 
having modified or simplified protein structures. Secondgeneration proteins will 
typically share one or more properties in common with the fulHength polypeptides, such 
as a particular antigenic/immunogenic epitopic core sequences, or particular catalytic 
sites, active sites, or ligand binding domains, etc. Epitopic sequences can be provided on 
relatively short molecules prepared from knowledge of the peptide, or encoding DNA 
sequence information. Such variant molecules may not only be derived from selected 
immunogenic/ antigenic regions of the protein structure, but may additionally, or 
alternatively, include one or more functionally equivalent amino acids selected on the 
basis of similarities or even differences with respect to the natural sequence. This is 
particularly desirable in the preparation of recombinant polypeptides having enhanced or 
superior stability, activity, binding, or affinity for substrates and the like. 

The general process of recombinant expression of proteins in bacterial hosts, and 
particularly Gram-negative hosts, is well-known to those of skill in the art. It is generally 
preferred for the methods described herein that the DNA sequence encoding the 
particular eukaryotic protein of interest to be secreted be operativeiy linked to a DNA 
sequence which encodes a signal peptide sufficient for the translocation of the 
recombinant polypeptide to the periplasmic space of the bacterial host cell. As it is well- 
known, operative links between such DNA sequences mean that a translation^ fusion 
exists between the heterologous protein and the signal peptide. As a rule, such signal 
peptides form the N-terminal portion of the secreted heterologous protein. Signal 
sequences which promote protein translocation to the periplasmic space of Gram- 
negative bacterial are well-known, as exemplified by those described herein. The E. coli 
OmpA, Lpp, LamB, MalE, PelB, and StII leader peptide sequences have been 
successfully used in many applications as signal sequences to promote protein secretion 
in bacterial cells such as those used herein, and are all contemplated to be useful in the 
practice of the invention. 
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4.8 Promoters, Enhancers, and Signal Sequence Elements 

The promoters and enhancers that control the transcription of protein-encoding 
genes are composed of multiple genetic elements. The cellular machinery is able to 
5 gather and integrate the regulatory information conveyed by each element, allowing 

different genes to evolve distinct, often complex patterns of tran^riptional regulation. 

The term promoter will be used here to refer to a group of transcriptional control 
modules that are clustered around the initiation site for RNA polymerase II. Much of the 
thinking about how promoters are organized derives from analyses of several viral 

10 promoters, including those for the HSV thymidine kinase (tk) and SV40 early 

transcription units* These studies, augmented by more recent work, have shown that 
promoters are composed of discrete functional modules, each consisting of 
approximately 7-20 bp of DNA, and containing one or more recognition sites for 
transcriptional activator proteins. At least one module in each promoter functions to 

1 5 position the start site for RNA synthesis. The best known example of this is the TATA 

box, but in some promoters lacking a TATA box, such as the promoter for the 
mammalian terminal deoxynucleotidyl transferase gene and the promoter for the SV 40 
late genes, a discrete element overlying the start site itself helps to fix the place of 
initiation. 

20 Additional promoter elements regulate the frequency of transcriptional initiation. 

Typically, these are located in the region 30-1 10 bp upstream of the start site, although a 
number of promoters have recently been shown to ..contain functional elements 
downstream of the start site as well. The spacing between elements is flexible, so that 
promoter function is preserved when elements are inverted or moved relative to one 

25 another. In the tk promoter, the spacing between elements can be increased to 50 bp 

apart before activity begins to decline. Depending on the promoter, it appears that 
individual elements can function either cooperatively or independently to activate 
transcription. 

Enhancers were originally detected as genetic elements that increased 
30 transcription from a promoter located at a distant position on the same molecule of DNA. 
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This ability to act over a large distance had little precedent in classic studies of 
prokaryotic transcriptional regulation. 

Subsequent work showed that regions of DNA with enhancer activity are 
organized much like promoters. That is, they are composed of many individual 
elements, each of which binds to one or more transcriptional proteins. 

The basic distinction between enhancers and promoters is operational. An 
enhancer region as a whole must be able to stimulate transcription at a distance; this need 
not be true of a promoter region or its component elements. On the other hand, a 
promoter must have one or more elements that direct initiation of RNA synthesis at a 
particular site and in a particular orientation, whereas enhancers lack these specificities. 
Aside from this operational distinction, enhancers and promoters are very similar entities. 
They have the same general function of activating transcription in the cell. They are 
often overlapping and contiguous, often seeming to have a very similar modular 
organization. Taken together, these considerations suggest that enhancers and promoters 
are homologous entities and that the transcriptional activator proteins bound to these 
sequences may interact with the cellular transcriptional machinery in fundamentally the 
same way. 

Particularly preferred promoters include the lac-lpp promoter which is well- 
known in the art. Other promoters contemplated to be useful in the practice of the 
invention include the ara, lac, tac, trc. trp, phoA, P BAD , X PL , Ipp, and the T7 promoter. 

4.9 Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual 
peptides, or biologically functional equivalent proteins or peptides, through specific 
mutagenesis of the underlying DNA. The technique, well-known to those of skill in the 
art, further provides a ready ability to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the 
production of mutants through the use of specific oligonucleotide sequences which 
encode the DNA sequence of the desired mutation, as well as a sufficient number of 
adjacent nucleotides, to provide a primer sequence of sufficient size and sequence 
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complexity to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 14 to about 25 nucleotides in length is preferred, with about 
5 to about 10 residues on both sides of the junction of the sequence being altered. 

In general, the technique of site-specific mutagenesis is well known in the art, as 
5 exemplified by various publications. As will be appreciated, the technique typically 

employs a phage vector which exists in both a single stranded ayd double stranded form. 
Typical vectors useful in site-directed mutagenesis include vectors such as the Ml 3 
phage. These phage are readily commercially-available and their use is generally 
well-known to those skilled in the art. Double-stranded plasmids are also routinely 

10 employed in site directed mutagenesis which eliminates the step of transferring the gene 

of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double-stranded 
vector which includes within its sequence a DNA sequence which encodes the desired 

15 peptide. An oligonucleotide primer bearing the desired mutated sequence is prepared, 

generally synthetically. This primer is then annealed with the single-stranded vector, and 
subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, 
in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex 
is formed wherein one strand encodes the original non-mutated sequence and the second 

20 strand bears the desired mutation. This heteroduplex vector is then used to transform 

appropriate cells, such as E. coli cells, and clones are selected which include recombinant 
vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected peptide-encoding DNA 
segments using site-directed mutagenesis is provided as a means of producing potentially 

25 useful species and is not meant to be limiting as there are other ways in which sequence 

variants of peptides and the DNA sequences encoding them may be obtained. For 
example, recombinant vectors encoding the desired peptide sequence may be treated with 
mutagenic agents, such as hydroxy lamine, to obtain sequence variants. Specific details 
regarding these methods and protocols are found in the teachings of Maloy (1990); 

30 Maloy el al (1994); Segal (1976); Prokop and Bajpai (1991); Maniatis et al. (1982); and 

Sambrook et al (1989), each incorporated herein by reference, for that purpose. 
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The PCR™-bascd strand overlap extension (SOE) for site-directed mutagenesis is 
particularly preferred for site-directed mutagenesis of the nucleic acid compositions of 
the present invention. The techniques of PCR™ are well-known to those of skill in the 
art, as described hereinabove. The SOE procedure involves a two-step PCR™ protocol, 
in which a complementary pair of internal primers (B and C) are used to introduce the 
appropriate nucleotide changes into the wild-type sequence. In two separate reactions, 
flanking PCR™ primer A (restriction site incorporated into^he oligo) and primer D 
(restriction site incorporated into the oligo) are used in conjunction with primers B and 
C, respectively to generate PCR™ products AB and CD. The PCR™ products arc 
purified by agarose gel electrophoresis and the two overlapping PCR™ fragments AB 
and CD are combined with flanking primers A and D and used in a second PCR™ 
reaction. The amplified PCR™ product is agarose gel purified, digested with the 
appropriate enzymes, ligated into an expression vector, and transformed into E. coli 
JMlO'l. XL1-Blu e ™ (Stratagene, La Jolla, CA), JM105, TGI (Carter el ai, 1985), or 
other such suitable cells as deemed appropriate depending upon the particular application 
of the invention. Clones are isolated and the mutations are confirmed by sequencing of 
the isolated plasmids. Beginning with the native gene sequences, for example, the 
nucleic acid sequences encoding eukaryotic disulfide-bond-containing polypeptides such 
as PTi or tPA and the like, suitable clones and subclones may be made in the appropriate 
vectors from which site-specific mutagenesis may be performed. 

4.10 Biological Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the 
present invention and DNA segments which encode them and still obtain a functional 
molecule that encodes a protein or peptide with desirable characteristics. The following 
is a discussion based upon changing the amino acids of a protein to create an equivalent, 
or even an improved, second-generation molecule. The amino acid changes may be 
achieved by changing the codons of the DNA sequence, according to Table 1 . 
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TABLE 1 



Amino Acids 



Codons 
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Aspartic acid 
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Glutamic acid 
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Phenylalanine 
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t IT TT I 
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Glycine 


Gly 
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GGA 
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GGG 
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Histidine 
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CAC 
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CAU 






Isoleucine 


He 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Vai 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Tip 


W 


UGG 








Tyrosin 


Tyr 


Y 


UAC 


UAU 







CUG CUU 



CGG 
UCG 



CGU 

ucu 



For example, certain amino acids may be substituted for other amino acids in a 
protein structure without appreciable loss of interactive binding capacity with structures 
5 such as, for example, antigen-binding regions of antibodies or binding sites on substrate 

molecules. Since it is the interactive capacity and nature of a protein that defines that 
protein's biological functional activity, certain amino acid sequence substitutions can be 
made in a protein sequence, and, of course, its underlying DNA coding sequence, and 
nevertheless obtain a protein with like properties. It is thus contemplated by the 
10 inventors that various changes may be made in the peptide sequences of the disclosed 
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compositions, or corresponding DNA sequences which encode said peptides without 
appreciable loss of their biological utility or activity. 

In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporate herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to thele condary structure of the 
resultant protein, which in turn defines the interaction of the protein with other, 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: 
isolcucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino 
acids having a similar hydropathic index or score and still result in a protein with similar 
biological activity, /.*., still obtain a biological functionally equivalent protein. In 
making such changes, the substitution of amino acids whose hydropathic indices are 
within ±2 is preferred, those which are within ±1 are particularly preferred, and those 
within ±0.5 are even more particularly preferred. It is also understood in the art that the 
substitution of like amino acids can be made effectively^ the basis of hydrophilicity. 
U. S. Patent 4,554,101, incorporated herein by reference, states that the greatest local 
average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent 
amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values have 
been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); 
glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 
threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
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substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
is preferred, those which are within ±1 are particularly preferred, and those within ±0.5 
5 are even more particularly preferred. 

As outlined above, amino acid substitutions are general]^ therefore based on the 
relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which 
take various of the foregoing characteristics into consideration are well known to those of 
10 skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 

threonine; glutarnine and asparagine; and valine, leucine and isoleucine. 

4.11 Pharmaceutical Compositions 

In certain embodiments, the inventors contemplate the formulation of the 
15 eukaryotic polypeptides produced in bacteria using the methods disclosed herein into 

pharmaceutically-acceptable compositions for administration to an animal, and in 

particular, a mammal such as a human. 

Such pharmaceutical compositions may be orally administered, for example, 

with an inert diluent or with an assimilable edible carrier, or they may be enclosed in 
20 hard or soft shell gelatin capsule, or they may be compressed into tablets, or they may be 

incorporated directly with the food of the diet. For oral therapeutic administration, the 

active compounds may be incorporated with excipients an* used in the form of ingestible 

tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. 

Such compositions and preparations should contain at least 0.1% of active compound. 
25 The percentage of the compositions and preparations may, of course, be varied and may 

conveniently be between about 2 to about 60% of the weight of the unit. The amount of 

active compounds in such therapeutically useful compositions is such that a suitable 

dosage will be obtained. 

The tablets, troches, pills, capsules and the like may also contain the following: a 
30 binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium 

phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the 
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like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, 
lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of 
wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier. Various other materials may 
be present as coatings or to otherwise modify the physical form of the dosage unit. For 
instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of 
elixir may contain the active compounds sucrose as a sweJening agent methyl and 
propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of 
course, any material used in preparing any dosage unit form should be pharmaceutical^ 
pure and substantially non-toxic in the amounts employed. In addition, the active 
compounds may be incorporated into sustained-release preparation and formulations. 

The active compounds may also be administered parenterally or 
intraperitoneally. Solutions of the active compounds as free base or pharmacologically 
acceptable salts can be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid 
polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 
microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersions. In all cases the form must be sterile and must be fluid 
to the extent that easy syringability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion 
medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 
coating, such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of surfactants. The prevention of the action of microorganisms 
can be brought about by various antibacterial and antifungal agents, for example, 
parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it 
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will be preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearatc 
and gelatin. 

5 Sterile injectable solutions are prepared by incorporating the active compounds 

in the required amount in the appropriate solvent with various^bf the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredients into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 

10 from those enumerated above. In the case of sterile powders for the preparation of sterile 

injectable solutions, the preferred methods of preparation are vacuum-drying and freeze- 
drying techniques which yield a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof. 

As used herein, "pharmaceutical^ acceptable carrier" includes any and all 

15 solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 

absorption delaying agents and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active ingredient, its use in the 
therapeutic compositions is contemplated. Supplementary active ingredients can also be 

20 incorporated into the compositions. 

For oral prophylaxis the polypeptide may be incorporated with excipients and 
used in the form of non-ingestible mouthwashes and dentifrices. A mouthwash may be 
prepared incorporating the active ingredient in the required amount in an appropriate 
solvent, such as a sodium borate solution (Dobell ! s Solution). Alternatively, the active 

25 ingredient may be incorporated into an antiseptic wash containing sodium borate, 

glycerin and potassium bicarbonate. The active ingredient may also be dispersed in 
dentifrices, including: gels, pastes, powders and slurries. The active ingredient may be 
added in a therapeutically effective amount to a paste dentifrice that may include water, 
binders, abrasives, flavoring agents, foaming agents, and humectants. 

30 The phrase "pharmaceutically-acceptable" refers to molecular entities and 

compositions that do not produce an allergic or similar untoward reaction when 
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administered to a human. The preparation of an aqueous composition that contains a 
protein as an active ingredient is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or suspensions; solid 
forms suitable for solution in, or suspension in, liquid prior to injection can also be 
prepared. The preparation can also be emulsified. 

The composition can be formulated in a neutral or salt form. 
Pharmaceutically-acceptable salts, include the acid addition Kilts (formed with the free 
amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 
histidine, procaine and the like. Upon formulation, solutions will be administered in a 
manner compatible with the dosage formulation and in such amount as is therapeutically 
1 5 effective. The formulations are easily administered in a variety of dosage forms such as 

injectable solutions, drug release capsules and the like. 

For parenteral administration in an aqueous solution, for example, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, sterile aqueous media which can be employed will be known to those of skill 
in the art in light of the present disclosure. For example, one dosage could be dissolved 
in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid 
or injected at the proposed site of infusion, (see for example, "Remington's 
Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some 
variation in dosage will necessarily occur depending on the condition of the subject being 
treated. The person responsible for administration will, in any event, determine the 
appropriate dose for the individual subject. Moreover, for human administration, 
preparations should meet sterility, pyrogenicity, general safety and purity standards as 
30 required by FDA Office of Biologies standards. 



20 
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4.12 Liposomes and Nanocapsules 

In certain aspects, it may be desirable to formulate the novel fusion proteins of 
the present invention into formulations for administration to an animal or other organism. 
The administration of the compositions disclosed herein may be accomplished with 
5 pharmaceutical formulations of these eukaryotic polypeptides in the form of liposomes or 

nanocapsules for either general administration or for specific ^yrgeting to certain areas, 
cells, or tissues of an animal. 

The formation and use of liposomes is generally known to those of skill in the art 
(see e.g., Couvreur et al., 1977; 1988, which describe the use of liposomes and 

1 0 nanocapsules in the targeted antibiotic therapy of intracellular bacterial infections and 

diseases). Recently, liposomes were developed with improved serum stability and 
circulation half-times (Gabizon and Papahadjopoulos, 1988; Allen and Choun, 1987). 

Liposomes have been used successfully with a number of cell types that are 
normally resistant to transfection by other procedures including T cell suspensions, 

15 primary hepatocyte cultures and PC 12 cells (Muller et al. y 1990). In addition, liposomes 

are free of the DN A length constraints that are typical of viral-based delivery systems. 
Liposomes have been used effectively to introduce genes, drugs (Heath and Martin, 
1986; Heath et ai y 1986; Balazsovits et aL, 1989), radiotherapeutic agents (Pikul et ai, 
1987), enzymes (Imaizumi et al. y 1990a; Imaizumi e/a/., 1990b), viruses (Faller and 

20 Baltimore, 1984), transcription factors and allosteric effectors (Nicolau and Gersonde, 

1979) into a variety of cultured cell lines and animals. 

In addition to the teachings of Couvreur et a/^(1977; 1988), the following 
information may be utilized in generating liposomal formulations. Phospholipids can 
form a variety of structures other than liposomes when dispersed in water, depending on 

25 the molar ratio of lipid to water. At low ratios the liposome is the preferred structure. 

The physical characteristics of liposomes depend on pH, ionic strength and the presence 
of divalent cations. Liposomes can show low permeability to ionic and polar substances, 
but at elevated temperatures undergo a phase transition which markedly alters their 
permeability. The phase transition involves a change from a closely packed* ordered 

30 structure, known as the gel state, to a loosely packed, less-ordered structure, known as 
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the fluid state. This occurs at a characteristic phase-transition temperature and results in 
an increase in permeability to ions, sugars and drugs. 

In addition to temperature, exposure to proteins can alter the permeability of 
liposomes. Certain soluble proteins such as cytochrome c bind, deform and penetrate the 
bilayer, thereby causing changes in permeability. Cholesterol inhibits this penetration of 
proteins, apparently by packing the phospholipids more tightjv. It is contemplated that 
the most useful liposome formations for antibiotic and inhiKtor delivery will contain 
cholesterol. 

Targeting is generally not a limitation in terms of the present invention. 
However, should specific targeting be desired, methods are available for this to be 
accomplished. Antibodies may be used to bind to the liposome surface and to direct the 
antibody and its drug contents to specific antigenic receptors located on a particular cell- 
type surface. Carbohydrate determinants (glycoprotein or glycolipid cell-surface 
components that play a role in cell-cell recognition, interaction and adhesion) may also 
be used as recognition sites as they have potential in directing liposomes to particular cell 
types. Mostly, it is contemplated that intravenous injection of liposomal preparations 
would be used, but other routes of administration are also conceivable. 

Alternatively, the invention provides for pharmaceutically-acceptable 
nanocapsule formulations of dopamine receptor agonists. Nanocapsules can generally 
entrap compounds in a stable and reproducible way (Henry-Michelland et al, 1987). To 
avoid side effects due to intracellular polymeric overloading, such ultrafine particles 
(sized around 0.1 :m) should be designed using polymers able to be degraded in vivo. 
Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are 
contemplated for use in the present invention, and such particles may be are easily made, 
25 as described (Couvreure/o/., 1984; 1988). 

4.13 Antibody Compositions 

In another aspect, the present invention contemplates an antibody that is 
immunoreactive with one of the recombinant eukaryotic polypeptides obtained by the 
disclosed methods of producing such fusion peptides in a bacterial host. Reference to 
antibodies throughout the specification includes whole polyclonal and monoclonal 
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antibodies, and parts thereof, either alone or conjugated with other moieties. Antibody 
parts include Fab and F(ab) 2 fragments and single chain antibodies. The antibodies may 
be made in vivo in suitable laboratory animals or in vitro using recombinant DNA 
techniques. An antibody can be a polyclonal or a monoclonal antibody. In a preferred 
5 embodiment, an antibody is a polyclonal antibody. 

Means for preparing and characterizing antibodies are -wgj|! known in the art (See, 
e.g., Harlow and Lane, 1988). Briefly, a polyclonal antibody is prepared by immunizing 
an animal with an immunogen comprising a polypeptide of the present invention and 
collecting antisera from that immunized animal. A wide range of animal species can be 
10 used for the production of antisera. Typically an animal used for production of anti- 

antisera is a rabbit, a mouse, a rat, a hamster or a guinea pig. Because of the relatively 
large blood volume of rabbits, a rabbit is a preferred choice for production of polyclonal 
antibodies. 

As is well known in the art, a given composition may vary in its immunogenicity. 

15 It is often necessary therefore to boost the host immune system, as may be achieved by 

coupling a peptide or polypeptide immunogen to a carrier. Exemplary and preferred 
carriers are keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other 
albumins such as ovalbumin, mouse serum albumin or rabbit serum albumin can also be 
used as carriers. Means for conjugating a polypeptide to a carrier protein are well known 

20 in the art and include glutaraldehyde, /w-maleimidobencoyl-N-hydroxysuccinimide ester, 

carbodiimide and bis-biazotized benzidine. 

As is also well known in the art, the immunogenioity of a particular immunogen 
composition can be enhanced by the use of non-specific stimulators of the immune 
response, known as adjuvants. Exemplary and preferred adjuvants include complete 

25 Freund's adjuvant (a non-specific stimulator of the immune response containing killed 

Mycobacterium tuberculosis), incomplete Freund's adjuvants and aluminum hydroxide 
adjuvant. 

mAbs may be readily prepared through use of well-known techniques, such as 
those exemplified in U. S. Patent 4,196,265, incorporated herein by reference. Typically, 
30 this technique involves immunizing a suitable animal with a selected immunogen 

composition, e.g., a purified or partially purified protein, polypeptide or peptide. The 

-52- 



3NSDOCID: <WO 97381 23A1_I_> 



10 



15 



WO 97/38123 PCT/US97/05636 

immunizing composition is administered in a manner effective to stimulate antibody 
producing cells. Rodents such as mice and rats are preferred animals, however, the use 
of rabbit, sheep frog cells is also possible. The use of rats may provide certain 
advantages (Goding, 1986), but mice are preferred, with the BALB/c mouse being most 
preferred as this is most routinely used and generally gives a higher percentage of stable 
fusions. 

Following immunization, somatic cells with the^potential for producing 
antibodies, specifically B-lymphocytes (B-cells), are selected for use in the mAb 
generating protocol. These cells may be obtained from biopsied spleens, tonsils or 
lymph nodes, or from a peripheral blood sample. Spleen cells and peripheral blood cells 
are preferred, the former because they are a rich source of antibody-producing cells that 
are in the dividing plasmablast stage, and the latter because peripheral blood is easily 
accessible. Often, a panel of animals will have been immunized and the spleen of animal 
with the highest antibody titer will be removed and the spleen lymphocytes obtained by 
homogenizing the spleen with a syringe. Typically, a spleen from an immunized mouse 
contains approximately about 5 H 10 7 to about 2 H 10 s lymphocytes. 

The antibody-producing B lymphocytes from the immunized animal are then 
fused with cells of an immortal myeloma cell, generally one of the same species as the 
animal that was immunized. Myeloma cell lines suited for use in hybridoma-producing 
fusion procedures preferably are non-antibody-producing, have high fusion efficiency, 
and enzyme deficiencies that render then incapable of growing in certain selective media 
which support the growth of only the desired fused cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of 
skill in the art (Goding, 1986; Campbell, 1984). One preferred murine myeloma cell is 
25 the NS-1 myeloma cell line (also termed P3-NS-l-Ag4-l), which is readily available 

from the NIGMS Human Genetic Mutant Cell Repository by requesting cell line 
repository number GM3573. Another mouse myeloma cell line that may be used is the 
8-azaguanine-resistant mouse murine myeloma SP2/0 non-producer cell line. 

Methods for generating hybrids of antibody-producing spleen or lymph node cells 
and myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 
ratio, though the ratio may vary from about 20:1 to about 1:1, respectively, in the 
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presence of an agent or agents (chemical or electrical) that promote the fusion of cell 
membranes. Fusion methods using Sendai virus have been described (Kohler and " 
Milstein, 1975; 1976), and those using polyethylene glycol (PEG), such as 37% (v/v) 
PEG, by Gefter et ai (1977). The use of electrically induced fusion methods is also 
5 appropriate (Goding, 1986). 

Antibodies, both polyclonal and monoclonal, specific fqj the eukaryotic fusion 
proteins may be prepared using conventional immunization techniques, as will be 
generally known to those of skill in the art. A composition containing antigenic epitopes 
of particular eukaryotic fusion protein can be used to immunize one or more 

10 experimental animals, such as a rabbit or mouse, which will then proceed to produce 

specific antibodies against epitope-containing eukaryotic fusion proteins. Polyclonal 
antisera may be obtained, after allowing time for antibody generation, simply by 
bleeding the animal and preparing serum samples from the whole blood. 

When peptides are used as antigens to raise polyclonal sera, one would expect 

15 considerably less variation in the clonal nature of the sera than if a whole antigen were 

employed. Unfortunately, if incomplete fragments of an epitope are presented, the 
peptide may very well assume multiple (and probably non-native) conformations. As a 
result, even short peptides can produce polyclonal antisera with relatively plural 
specificities and, unfortunately, an antisera that does not react or reacts poorly with the 

20 native molecule. 

Polyclonal antisera according to present invention is produced against peptides 
that are predicted to comprise whole, intact epitopes. It if believed that these epitopes 
are, therefore, more stable in an immunologic sense and thus express a more consistent 
immunologic target for the immune system. Under this model, the number of potential 

25 B-cell clones that will respond to this peptide is considerably smaller and, hence, the 

homogeneity of the resulting sera will be higher. In various embodiments, the present 
invention provides for polyclonal antisera where the clonal ity, i.e., the percentage of 
clone reacting with the same molecular determinant, is at least 80%. Even higher 
clonality - 90%, 95% or greater - is contemplated. 

30 It is proposed that the monoclonal antibodies of the present invention will find 

useful application in standard immunochemical procedures, such as ELISA and Western 
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blot methods, as well as other procedures which may utilize antibody specific to the 
particular epitopes. Additionally, it is proposed that monoclonal antibodies specific io 
the particular eukaryotic fusion peptide may be utilized in other useful applications. For 
example, their use in immunoabsorbent protocols may be useful in purifying native or 
recombinant peptide species or synthetic or natural variants thereof. In general, both 
poly- and monoclonal antibodies against these peptides may be used in a variety of 
embodiments. For example, they may be employed in antux>dy cloning protocols to 
obtain cDNAs or genes encoding the peptides disclosed herein or related proteins. They 
may also be used in inhibition studies to analyze the effects of particular peptides in cells 
or animals. A particularly useful application of such antibodies is in purifying the fusion 
protein, for example, using an antibody affinity column. The operation of all such 
immunological techniques will be known to those of skill in the art in light of the present 
disclosure. 

4.1 4 Detection of Peptide and Antibody Compositions 

It will be further understood that certain of the polypeptides may be present in 
quantities below the detection limits of typical staining procedures such as Coomassie 
brilliant blue or silver staining, which are usually employed in the analysis of 
SDS/PAGE gels, or that their presence may be masked by an inactive polypeptide of 
similar M r . Although not necessary to the routine practice of the present invention, it is 
contemplated that other detection techniques may be employed advantageously in the 
visualization of particular polypeptides of interest. Immunologically-based techniques 
such as Western blotting using enzymatically-, radiolabel-, or fluorescently-tagged 
antibodies described herein are considered to be of particular use in this regard. 
Alternatively, the peptides of the present invention may be detected by using antibodies 
of the present invention in combination with secondary antibodies having affinity for 
such primary antibodies- This secondary antibody may be enzymatically- or 
radiolabeled, or alternatively, fluorescently-, or colloidal gold-tagged. Means for the 
labeling and detection of such two-step secondary antibody techniques are well-known to 
those of skill in the art. 
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4.1 5 Protein Purification Methods 

Further aspects of the present invention concern the purification, and in particular 
embodiments, the substantia] purification, of the recombinant eukaryotic fusion proteins 
produced in bacterial host cells. The phrase "purified protein" as used herein, is intended 
5 to refer to a polypeptide composition* isolatable from the soluble fraction or cell culture 

supernatant of a recombinant bacterial cell, wherein the protein^ purified to any degree 
relative to its naturally-obtainable state, i.e., in this case, relative to its purity within the 
soluble fraction or cell culture supernatant of a recombinant bacterial cell. A purified 
protein, therefore, also refers to isolated protein, free from the environment in which it 

10 may naturally occur. 

Generally, "purified" will refer to a protein composition which has been subjected 
to fractionation to remove various non-polypeptide components, and which composition 
substantially retains its normal ability. For example, in the case of enzymes such as tPA 
or BPTI, that the purified protein retain its biological or enzymatic activity. Where the 

15 term "substantially purified" is used, this will refer to a composition in which F factor 

forms the major component of the composition, such as constituting from about 50% to 
about 60% of the protein in the composition or more. 

Various methods for quantifying the degree of purification of the recombinant 
polypeptides of the present invention will be known to those of skill in the art in light of 

20 the present disclosure. These include, for example, determining the specific activity of 

an active fraction, or assessing the number of polypeptides within a fraction by 
SDS/PAGE analysis. A preferred method for assessing th^ purity of the protein fraction 
is to calculate the specific activity of the fraction, to compare it to the specific activity of 
the initial source (e.g., the soluble fraction or cell culture supernatant, and to thus 

25 calculate the degree of purity, herein assessed by a "-fold purification number." 

The actual units used to represent the amount of inhibitory activity will, of 
course, be dependent upon the particular assay technique chosen to follow the 
purification. For example, in the case of tPA, the inventors prefer to use a commercially- 
available chromogenic assay (Spectrolyse™, American Diagnostics, Inc., Greenwich, 

30 TC). 
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Likewise, in the case of BPTI, the inventors have quantitated the enzyme using 
the ELISA assay as despribed in Section 5. 

As is generally known in the art, to determine the specific activity, one would 
calculate the number of units of activity per milligram of total protein. In the purification 
procedure, the specific activity of the starting material, i.e., of the soluble fraction or 
culture supernatant containing the desired recombinant, would represent the specific 
activity of the protein in its natural state. At each step, one"would generally expect the 
specific activity of the protein to increase above this value, as it is purified relative to its 
natural state. In preferred embodiments, it is contemplated that one would assess the 
degree of purity of a given protein fraction by comparing its specific activity to the 
specific activity of the starting material, and representing this as X-foId purification. The 
use of "-fold purification" is advantageous as the purity of an inhibitory fraction can thus 
be compared to another despite any differences which may exist in the actual units of 
activity or specific activity. 

15 11 is contemplated that the eukaryotic fusion polypeptides of the present invention 

be purified to between about 10-fold and about 200-fold, and preferably, between about 
30-fold and about 150-fold. 

Generally, "purified" will refer to a composition comprising a eukaryotic fusion 
protein which has been subjected to fractionation to remove various non-polypeptide 

20 components such as other cell components. Various techniques suitable for use in 

protein purification will be well known to those of skill in the art. These include, for 
example, precipitation with ammonium sulfate, PEG, antibodies and the like or by heat 
denaturation, followed by centrifugation; chromatography steps such as ion exchange, 
gel filtration, reverse phase, hydroxylapatite and affinity chromatography; isoelectric 

25 focusing; gel electrophoresis; and combinations of such and other techniques. A specific 

example presented herein is the purification of the proteins using gel filtration 
chromatography . 

The preferred purification method disclosed hereinbelow contains several steps 
and represents the best mode presently known by the inventors to prepare a substantially 
30 purified protein. This method is currently preferred as it results in the substantial 

purification of the polypeptide, as assessed by gel filtration, in yields sufficient for 
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further characterization and use. This preferred mode of protein purification involves the 
execution of certain purification steps in the order described hereinbelow. However, as 
is generally known in the art, it is believed that the order of conducting the various 
purification steps may be changed, or that certain steps may be omitted, and still result in 
5 a suitable method for the preparation of a substantially purified recombinant protein. 

As mentioned above, although preferred for use irrccrtajj! embodiments, there is 
no general requirement that the protein always be provided in its most-purified state- 
Indeed, it is contemplated that less substantially purified proteins, which is nonetheless 
enriched in the desired recombinant protein activity relative to the natural state, will have 

10 utility in certain embodiments. For example crude tPA may be employed in assay kits 

for determining the activity of tPA inhibitors. Similarly, BPTI may be employed to 
detect trypsin activity. 

One important technique in the art for the purification of polypeptides is the 
method of affinity chromatography. Affinity chromatography is generally based on the 

15 recognition of a protein by a substance such as a ligand or an antibody. The column 

material may be synthesized by covalently coupling a binding molecule, such as an 
activated dye, for example to an insoluble matrix. The column material is then allowed 
to adsorb the desired substance from solution. Next, the conditions are changed to those 
under which binding does not occur and the substrate is eluted. The requirements for 

20 successful affinity chromatography are: 

1 ) that the matrix must specifically-adsorb the molecules of interest; 

2) that other contaminants remain unadsorbed;-* 

3) that the ligand must be coupled without altering its binding activity; 

4) that the ligand must bind sufficiently tight to the matrix; and 

25 5) that it must be possible to elute the molecules of interest without 

destroying them. 

A preferred embodiment of the present invention is an affinity chromatography 
method for purification of the recombinant eukaryotic polypeptides from solution 
(including cell culture supernatant, and cell soluble extracts) wherein the matrix contains 
30 an antibody specific for the particular polypeptide, covalently-coupled to a Sepharose 

CL6B or CL4B. Such an affinity matrix would then bind the polypeptides of the present 

-58- 



JNSDOCID: <WQ_9738123A1_I_> 



10 



15 



20 



25 



30 



WO 97/38123 PCI7US97/05636 

invention directly and allows their separation by elution with an appropriate gradient 
such as a buffer, salt, GuHCJ, pH, or urea. 

4. J 6 Chromatography and SDS-PAGE of the Recombinant Polypeptides 

The recombinant polypeptides of the present invention may be particularly 
characterized based on a number of physical, chemical, and biophysical properties. For 
example, the molecular weight of a given protein may be determined using conventional 
means known to those of skill in the art, e.g., as determined by gel filtration column 
chromatography. 

Gel filtration chromatography is a means of determining molecular weight of 
protein species, and is a well-known technique. In general, a preferred gel to be used in 
the procedures of the present invention is a three dimensional network which has a 
random structure. Molecular sieve gels consist of cross-linked polymers that do not bind 
or react with the material being analyzed or separated. For gel filtration purposes, the gel 
material is generally uncharged. The space within the gel is filled with liquid and the 
liquid phase constitutes the majority of the gel volume. Materials commonly used in gel 
filtration columns include dextran, agarose and polyacrylamide. 

Dextran is a polysaccharide composed of glucose residues and is commercially 
available under the names Sephadex (Phamacia Fine Chemicals, Inc.). The beads are 
prepared with various degrees of cross-linking in order to separate different sized 
molecules by providing various pore sizes. The size of the cross-linking molecule can 
also be increased to obtain larger pore sizes. Alky] dextran is cross-linked with N, N'- 
methylenebisacrylarnide to from Sephacryl-S300 which allows strong beads to be made 
that fractionate in larger ranges than Sephadex can achieve. 

The most preferred method of gel filtration in the present invention is agarose gel 
filtration. Agarose is a linear polymer of D-galactose and 3,6 anhydro-1 -galactose and 
the gel polymer is formed by hydrogen bonds. In gel filtration applications the agarose is 
provided as porous beads and the concentration of agarose determines pore size. This 
type of gel is useful for the separation of large, globular molecules such as proteins and 
for long linear molecules such as DNA. Agarose is commercially available under the 
name Sepharose (Sigma) in several pore sizes. In the procedures of the present 
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invention, Scpharose 4B which fractionates in the molecular weight range of 3 x 10 b to 3 
x 1 0 6 is the most preferred agarose preparation. 

Polyacrylamide is a polymer of cross-linked acrylamide prepared with N, N- 
methylenebisacrylamide as the cross-linking agent. Polyacrylamide is available in a 

5 variety of pore sizes from Bio-Rad Laboratories (USA) to be used for separation of 

different size particles. - j£ 

The gel material swell in water and in a few organic solvents. Swelling is the 
process by which the pores become filled with liquid to be used as eluant. As the smaller 
molecules enter the pores, their progress through the gel is retarded relative to the larger 

10 molecules which do not enter the pores. This is the basis of the separation. The beads 

are available in various degrees of fineness to be used in different applications. The 
coarser the bead, the faster the flow and the poorer the resolution. Superfine is to be used 
for maximum resolution, but the flow is very slow. Fine is used for preparative work in 
large columns which require a faster flow rate. The coarser grades are for large 

15 preparations in which resolution is less important than time, or for separation of 

molecules with a large difference in molecular weights. 

However, it is, of course, generally understood by those of skill in the art that 
both the migration of a polypeptide using SDS/PAGE, and the mobility of a polypeptide 
using different sizing columns can vary with different experimental conditions (Capaldi 

20 et al, 1977). It will therefore be appreciated that under differing electrophoretic and 

chromatographic conditions, the molecular weight assignments quoted above may vary. 

It will be further understood that certain of the polypeptides may be present in 
quantities below the detection limits of the Coornassie brilliant blue staining procedure 
usually employed in the analysis of SDS/PAGE gels, or that their presence may be 

25 masked by an inactive polypeptide of similar M r . Although not necessary to the routine 

practice of the present invention, it is contemplated that other detection techniques may 
be employed advantageously in the visualization of each of the polypeptides present 
within the growth factor. Immunologically-based techniques such as Western blotting 
using enzymatically-, radiolabel-, or fluorescently-tagged secondary antibodies are 

30 considered to be of particular use in this regard. 
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4.17 Modes for Carrying Out the Invention 

In the process herein, expression of the pdi gene is induced just before 
(immediately prior to) heterologous gene expression. The heterologous eukaryotic 
polypeptide and the PDI or rPDI protein arc both secreted into the periplasm or the 
heterologous polypeptide is secreted into the culture medium of the bacteria into which 
nucleic acid encoding these polypeptides has been- introduced. Preferably, the 
polypeptide is recovered from the periplasm of the bacteria. 

The pdi nucleic acid may be from any eukaryotic source, but preferably human, 
rat, or yeast, and is generally the native sequence. It is suitably separately placed from 
the nucleic acid encoding the heterologous polypeptide if nucleic acids are on the same 
vector, i.e., they are not linked. In addition, the nucleic acid encoding PDI and the 
nucleic acid encoding the heterologous polypeptide are under separate, different 
inducible promoters so that induction of expression can occur in the required sequential 
order. The nucleic acid encoding PDI and the nucleic acid encoding the heterologous 
polypeptide may be integrated into the host cell genome or contained on autonomously 
replicating plasmids. 

In one alternative, the recombinant host cell comprises two separate vectors 
respectively containing the nucleic acid encoding PDI and the nucleic acid encoding the 
heterologous polypeptide. 

In another alternative, the nucleic acid encoding PDI and the nucleic acid 
encoding the heterologous polypeptide are contained on the same vector but are under 
the control of separate inducible promoters and separate signal sequences. 

The heterologous nucleic acid (e.g., cDNA or genomic DNA) is suitably inserted 
into a replicable vector for expression in the bacterium under the control of a suitable 
promoter for bacteria. Many vectors are available for this purpose, and selection of the 
appropriate vector will depend mainly on the size of the nucleic acid to be inserted into 
the vector and the particular host cell to be transformed with the vector. Each vector 
contains various components depending on its function (amplification of DNA or 
expression of DNA) and the particular host cell with which it is compatible. The vector 
components for bacterial transformation generally include, but are not limited to, one or 
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more of the following: a signal sequence, an origin of replication, one or more marker 
genes, and an inducible promoter. 

In general, plasmid vectors containing replicon and control sequences that are 
derived from species compatible with the host cell are used in connection with bacterial 
5 hosts. The vector ordinarily carries a replication site, as well as marking sequences that 

are capable of providing phenotypic selection in transformed eej^p. 

The DNA encoding the polypeptide of interest herein may be expressed not only 
directly,, but also as a fusion with another polypeptide, preferably a signal sequence or 
other polypeptide having a specific cleavage site at the N-terminus of the mature 

10 polypeptide. In general, the signal sequence may be a component of the vector, or it may 

be a part of the polypeptide DNA that is inserted into the vector. The heterologous signal 
sequence selected should be one that is recognized and processed (i.e., cleaved by a 
signal peptidase) by the host cell. For bacterial host cells that do not recognize and 
process the native polypeptide signal sequence, the signal sequence is substituted by a 

15 bacterial signal sequence selected, for example, from the group consisting of the alkaline 

phosphatase, penicillinase, lpp 9 or heat-stable enterotoxin II leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables 
the vector to replicate in one or more selected host cells. Generally, in cloning vectors 
this sequence is one that enables the vector to replicate independently of the host 

20 chromosomal DNA, and includes origins of replication or autonomously replicating 

sequences. Such sequences are well known for a variety of bacteria. The origin of 
replication from either pBR322 or pACYC184 is suitable for most Gram-negative 
bacteria. 

Expression and cloning vectors also generally contain a selection gene, also 
25 termed a selectable marker. This gene encodes a protein necessary for the survival or 

growth of transformed host cells grown in a selective culture medium. Host cells not 
transformed with the vector containing the selection gene will not survive in the culture 
medium. Typical selection genes encode proteins that (a) confer resistance to antibiotics 
or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement 
30 auxotrophic deficiencies, or (c) supply critical nutrients not available from complex 

media, e.g.^ the gene encoding D-alanine racemase for Bacilli. One example of a 
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selection scheme utilizes a drug to arrest growth of a host cell. Those cells that are 
successfully transformed with a heterologous gene produce a protein conferring drug 
resistance and thus survive the selection regimen. 

The expression vector for producing a heterologous polypeptide also contains an 
inducible promoter that is recognized by the host bacterial organism and is operably 
linked to the nucleic acid encoding the polypeptide of interest. It also contains a separate 
inducible promoter operably linked to the nucleic acid encoding PDI. Inducible 
promoters suitable for use with bacterial hosts include the b-lactamase and lactose (lac) 
promoter systems (Chang et al., 1978; Goeddel et al, 1979), the arabinose (ara) 
promoter system (Guzman et al, 1992), alkaline phosphatase (phoA), a tryptophan (trp) 
promoter system (Goeddel, 1980; Eur. Pat. Appl. Publ. No. EP 36,776), X PL promoter, 
and hybrid promoters such as the tac promoter (deBoer et al, 1983). However, other 
known bacterial inducible promoters are suitable. Their nucleotide sequences have been 
published, thereby enabling a skilled worker operably to ligate them to DNA encoding 
15 the polypeptide of interest or to the pdi gene (Siebenlist et al., 1980) using linkers or 

adaptors to supply any required restriction sites. 

Promoters for use in bacterial systems also generally contain a Shine-Dalgamo 
(S.D.) sequence operably linked to the DNA encoding the polypeptide of interest. The 
promoter can be removed from the bacterial source DNA by restriction enzyme digestion 
and inserted into the vector containing the desired DNA. Construction of suitable 
vectors containing one or more of the above-listed components employs standard ligation 
techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in 
the form desired to generate the plasmids required. 

For analysis to confirm correct sequences in plasmids constructed, the ligation 
mixtures are used to transform E. coli K12 strain 294 (ATCC 31446), SF103, SF1 10, 
UT5600, RB791, or any other suitable strain, and successful transformants are selected 
using antibiotic resistance where appropriate. Plasmids from the transformants are 
prepared, analyzed by restriction endonuclease digestion, and/or sequenced by the 
method of Sanger et al. (1977) or Messing et al. (1981) or by the method of Maxam et al. 
30 (1980). 
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Suitable bacteria for this purpose include Archaebacteria and Eubacteria, 
especially Eubacteria, and most preferably Enterobacteriaceae. Examples of useful 
bacteria include Escherichia, Enterobacter, Azotobacter, Erwinia, Bacillus, 
Pseudomonas, Klebsiella, Proteus, Salmonella, Serratia, Shigella, Rhizobia? Vitreoscilla, 
and Paracoccus. Suitable E. coli hosts include E. coli SF103, SF1 10, UT5600, RB791, 
W31 10 (ATCC 27325), £. coli 294 (ATCC 31446), E. coli B^d £ coli x' 776 (ATCC 
31537). These examples are illustrative rather than limiting. Mutant cells of any of the 
above-mentioned bacteria may also be employed. It is, of course, necessary to select the 
appropriate bacteria taking into consideration replicability of the replicon in the cells of a 
bacterium. For example, E. coli, Serratia, or Salmonella species can be suitably used as 
the host when well known plasmids such as pBR322, pBR325, pACYC177, or pKN410 
are used to supply the replicon. 

E. coli strains such as ATCC XXXXX, or those disclosed in U. S. Patent 
5,508,192 (SF103, SF110, UT5600, and RB791) are preferred hosts or parent hosts for 
the practice of the invention, because they are protease deficient recombinantly 
engineered host cells which provide excellent recovery of recombinant polypeptides. 
Preferably, the host cell should secrete minimal amounts of proteolytic enzymes. 
Alternatively, other E. coli strains, for example, E. coli strain W3 1 1 0, may be modified 
to effect a genetic mutation in the genes encoding proteins, with examples of such hosts 
including E. coli W3110 strain 1A2, which has the complete genotype tonAD; E. coli 
W31 10 strain 9E4, which has the complete genotype tonAD ptr3; E. coli W31 10 strain 
27C7 (ATCC 55244), which has the complete genotype tonAD ptr3 phoADElS D(argF- 
lac) J 69 ompTD degP41kari\ E. coli W3110 strain 37D6, which has the complete 
genotype tonAD ptr3 phoADElS D(argF-lac) 1 69 ompTD degP41kan rbs7D ilvG; E. 
coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP 
deletion mutation; £. coli W3110 strain 33D3, which has the complete genotype tonA 
ptr3 laclq LacL8 ompT degP kan \ E. coli W3110 strain 36F8, which has the complete 
genotype tonA phoA D(argF-lac) ptr3 degP kan K //vG + , and is temperature resistant at 
37°C; and an E. coli strain having the mutant periplasmic protease(s) disclosed in U. S. 
Patent 4,946,783. 
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Host cells are transfected and preferably transformed with the above-described 
expression vectors and cultured in conventional nutrient media modified as appropriate 
for inducing promoters, selecting transformants, or amplifying the genes encoding the 
desired sequences. 

Transfection refers to the taking up of an expression vector by a host cell whether 
or not any coding sequences are in fact expressed. Numerous methods of transfection are 
known to the ordinarily skilled artisan, for example, Ca>0 4 and electroporation. 
Successful transfection is generally recognized when any indication of the operation of 
this vector occurs within the host cell. 

Transformation means introducing DNA into an organism so that the DNA is 
replicable, either as an extrachromosomal element or by chromosomal integrant. 
Depending on the host cell used, transformation is done using standard techniques 
appropriate to such cells. The calcium treatment employing calcium chloride, as 
described in section 1.82 of Sambrook et ai, (1989), is generally used for bacterial cells 
that contain substantial cell-wall barriers. Another method for transformation employs 
polyethylene glycoI/DMSO, as described in Chung and Miller (1988). Yet another 
method is the use of the technique termed electroporation. 

Bacterial cells used to produce the polypeptide of interest for purposes of this 
invention are cultured in suitable media in which the promoters for the nucleic acid 
encoding the heterologous polypeptide and for the nucleic acid encoding PDI can be 
artificially induced as described generally, e.g., in Sambrook et al. (1989). Examples of 
suitable media are given in U. S. Patents 5,304,472 and 5,342,763. 

Any necessary supplements besides carbon, nitrogen, and inorganic phosphate 
sources may also be included at appropriate concentrations introduced alone or as a 
mixture with another supplement or medium such as a complex nitrogen source. The pH 
of the medium may be any pH from about 5-9, depending mainly on the host organism. 
Optionally the culture medium may contain one or more reducing agents selected from 
the group consisting of glutathione, cysteine, cystamine, thioglycollate, dithioerythritol, 
dithiothreitol and dithioerythritol. Preferably, the bacteria are not cultured so as to over- 
express nucleic acid encoding the heat-shock transcription factor, RpoH. 
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For induction, typically the cells are cultured until a certain optical density is 
achieved, e.g., a A 550 of about 60-80, at which point induction is initiated (e.g., by 
addition of an inducer, by depletion of a medium component, etc.), to induce expression 
of the pdi gene. When the optical density reaches a higher amount, e.g., a A 550 of about 
80-100, induction of the second promoter for the heterologous polypeptide is effected. 

Gene expression may be measured in a sample du^btly, for example, by 
conventional northern blotting to quantitate the transcription of mRNA (Thomas, 1980). 
Various labels may be employed, most commonly radioisotopes, particularly 32 P. 
However, other techniques may also be employed, such as using biotin-modified 
nucleotides for introduction into a polynucleotide. The biotin then serves as the site for 
binding to avidin or antibodies, which may be labeled with a wide variety of labels, such 
as radionuclides, fluorescers, enzymes, or the like. 

Procedures for observing whether an expressed or over-expressed gene product is 
secreted are readily available to the skilled practitioner. Once the culture medium is 
separated from the host cells, for example, by centrifugation or filtration, the gene 
product can then be detected in the cell -free culture medium by taking advantage of 
known properties characteristic of the gene product. Such properties can include the 
distinct immunological, enzymatic, or physical properties of the gene product. 

For example, if an over-expressed gene product has a unique enzyme activity, an 
assay for that activity can be performed on the culture medium used by the host cells. 
Moreover, when antibodies reactive against a given gene product are available, such 
antibodies can be used to detect the gene product in any*known immunological assay 
(e.g., as in Harlow and Lane, 1988). 

The secreted gene product can also be detected using tests that distinguish 
polypeptides on the basis of characteristic physical properties such as molecular weight. 
To detect the physical properties of the gene product, all polypeptides newly synthesized 
by the host cell can be labeled, e.g., with a radioisotope. Common radioisotopes that can 
be used to label polypeptides synthesized within a host cell include tritium ( 3 H), carbon- 
14 ( ,4 C), sulfur-35 (^S), and the like. For example, the host cell can be grown in 35 S- 
methionine or 35 S-cysteine medium, and a significant amount of the 35 S label will be 
preferentially incorporated into any newly synthesized polypeptide, including the over- 
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expressed heterologous polypeptide. The 35 S-containing culture medium is then 
removed and the cells are washed and placed in fresh non-radioactive culture medium. 
After the cells are maintained in the fresh medium for a time and under conditions 
sufficient to allow secretion of the ^S-radiolabeled expressed heterologous polypeptide, 
the culture medium is collected and separated from the host cells. The molecular weight 
of the secreted, labeled polypeptide in the culture medium ^n then be determined by 
known procedures, e.g., polyacrylamide gel electrophoresis. Such procedures, and/or 
other procedures for detecting secreted gene products, are provided in Goeddel (1990), 
and Sambrook et al (1989). 

For secretion of an expressed or over-expressed gene product, the host cell is 
cultured under conditions sufficient for secretion of the gene product. Such conditions 
include, e.g., temperature, nutrient, and cell density conditions that permit secretion by 
the cell. Moreover, such conditions are those under which the cell can perform basic 
cellular functions of transcription, translation, and passage of proteins from one cellular 
compartment to another, as are known to those skilled in the art. 

In practicing the process of this invention, the yield of total polypeptide is 
generally increased, while yield of insoluble polypeptide is not changed or is decreased, 
i.e., yield of soluble polypeptide is increased. 

The polypeptide of interest is recovered from the periplasm or culture medium as 
a secreted soluble polypeptide. It is often preferred to purify the polypeptide of interest 
from recombinant cell proteins or polypeptides and from PDI to obtain preparations that 
are substantially homogeneous as to the polypeptide of interest. As a first step, the 
culture medium or lysate is centrifuged to remove particulate cell debris. The membrane 
and soluble protein fractions may then be separated if necessary. The polypeptide may 
then be purified from the soluble protein fraction and from the membrane fraction of the 
culture lysate, depending on whether the polypeptide is membrane associated or, more 
preferably, completely soluble in the periplasm or culture supernatant.. The polypeptide 
thereafter may be further solubilized and/or refolded, if necessary, and purified from 
contaminant soluble proteins and polypeptides. 

The types of phase-forming species to employ herein depend on many factors, 
including the type of polypeptide and the ingredients in the fermentation broth being 
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treated. The species must be selected so that the polypeptide does not precipitate and one 
phase is more hydrophobic than the other phase so that the polypeptide will be located in 
the more hydrophobic phase and the biomass solids and nucleic acids will settle to the 
less hydrophobic phase. 

The phase-forming species may be a combination of agents, including polymer 
combinations (polymer-polymer), polymer-salt combinations,^ vent-salt, and polymer- 
solvent combinations. Suitable polymers are both highly hydrophilic polymers and less 
hydrophilic polymers, Le.* any phase-forming polymers that are known in the art. 
Examples include polyethylene glycol or derivatives thereof, including various molecular 
weights of PEG such as PEG 4000, PEG 6000, and PEG 8000, derivatives of PEG 
described, for example, in Grunfeld et al (1992), polyvinylpyrrolidone (PVP), in a 
preferable molecular weight range of about 36,000 to 360,000, starches such as dextran 
(e.g., dextran 70 and 500), dextrins, and maltodextrins (preferable molecular weight 
between about 600 and 5,000), sucrose, and Ficoll-400™ polymer (a copolymer of 
sucrose and epichlorohydrin). The preferred polymer herein is polyethylene glycol, 
polypropylene glycol, polyvinylpyrrolidone, or a polysaccharide such as a dextran. The 
most preferred polymer herein is PEG of different molecular weights or a PEG- 
polypropylene glycol combination or copolymer. 

Examples of suitable organic solvents include ethylene glycol, glycerol, dimethyl 
sulfoxide, polyvinylalcohol, dimethylformamide, dioxane, and alcohols such as 
methanol, ethanol, and 2-propanol. Such solvents are such that, when added to aqueous 
solution, they increase the hydrophobicity of the solution. 

The salts can be inorganic or organic and preferably do not act to precipitate the 
polypeptide. Salts containing transition elements are not preferred as they tend to 
precipitate the polypeptide. Anions are selected that have the potential for forming 
aqueous multiple-phase systems. Examples include ammonium sulfate, sodium dibasic 
phosphate, sodium sulfate, ammonium phosphate, potassium citrate, magnesium 
phosphate, sodium phosphate, calcium phosphate, potassium phosphate, potassium 
sulfate, magnesium sulfate, calcium sulfate, sodium citrate, manganese sulfate, 
manganese phosphate, etc. Types of salts that are useful in forming bi-phasic aqueous 
systems are evaluated more fully by Zaslavskii et al. (1988). Preferred salts herein are 
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sulfates, phosphates, or citrates and are alkali or alkaline earth metals. More preferred 
are sulfates and citrates, and most preferred are sulfates since there are fewer pH 
limitations with sulfates. The most preferred salts herein are sodium sulfate and sodium 
citrate. 

The amounts of phase-forming species to add to the polypeptide of interest to 
obtain a satisfactory multiple- phase system arc those known in the art. The amount of 
phase-forming species added to the polypeptide will depenf on such factors as, for 
example, the amount of chaotropic agent and reducing agent, if any, already present in 
the fermentation broth, the nature of the cell culture media, the type of cells used in the 
fermentation, the type of polypeptide being treated, whether the polypeptide will be 
recovered from the lower or upper phase, and the type(s) of phase-forming species being 
added. The general concentration of polymer employed is about 5% (w/w) up to the 
limit of solubility for the polymer and the concentration of salt employed is about 3% 
(w/w) up to the limit of solubility for the salt, depending on the size of the phase-volume 
ratio needed. The phase-volume ratio must be sufficient to accommodate the biomass 
solids. The types and amounts of phase-forming species that are effective can be 
determined by phase diagrams and by evaluating the final result, i.e., the degree of purity 
and the yield of the polypeptide of interest. If the phase-forming species are a polymer- 
salt combination, preferably the concentration of salt added is about 4-15% (wtVwt.) and 
the concentration of polymer is 5-18% (wtVwt.) so that the desired polypeptide will be in 
an opposite phase from that in which the biomass solids and nucleic acids are present. 

If the system desired is one where the polypeptidejs distributed in the top phase 
and the biomass solids and nucleic acids are in the bottom phase, then there is a window 
of concentrations of phase-forming species. When higher amounts of chaotropic agent 
are added to maintain solubilization, the higher the amount of phase-forming species 
required. However, a high concentration of all these reagents will increase the density of 
the solution. A high density will cause the biomass solids to settle less readily. An 
overly high density will cause biomass solids to float on the surface. Hence, the 
concentrations of chaotropic agent and phase-forming species must be sufficiently high 
to maintain a fully solubilized polypeptide, but low enough to allow the biomass solids to 
sediment to the opposite (lower) phase. 
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If the polypeptide is to be recovered in the upper phase, typically the salt 
concentration will be about 4-7% (wt./wt.) and the polymer concentration will be about 
12-18% (wt./wt), depending, e.g., on the type of salt, polymer, and polypeptide. If an 
organic solvent is added as a phase-forming species, such as ethanoL it is preferably 
added in a concentration of about 10 to 30% (vol./vol.) of the solution, depending, e.g., 
on the type of polypeptide and alcohol and if any other phase^rming species is present, 
preferably at a concentration of about 20% (vol ./vol.). 

The exact conditions for contacting the cell culture with the various reagents will 
depend on, e.g., the pH of the buffer, the types of phase-forming reagents, and the types 
and concentrations of polypeptide and chaotropic and reducing agents. The reaction 
temperature is generally about 20 to about 40°C, more preferably room temperature. The 
contacting step will generally be carried out for at least about 30 min., preferably about 
30 min. to about 12 hr depending on whether side-reactions will occur, more preferably 
about 30 min. to about 8 hr, and most preferably about 30 min. to about 1 .5 hr. 

Once the multiple-phase system is established, one phase will be enriched in the 
polypeptide and depleted in the disrupted particles and cells comprising the biomass 
solids and nucleic acids. In a two-phase system, preferably the top phase is enriched in 
the polypeptide whereas the bottom phase is enriched in the disrupted particles and cells. 
The polypeptide can be easily recovered by separation of the phases. This recovery step 
may be accomplished by decanting the upper phase, by draining the lower phase, or by 
centrifugation. The polypeptide can then be isolated from the phase in which it is 
contained by changing the pH of the phase so as to pfecipitate the polypeptide or by 
adding a suitable solvent, whereupon the precipitated polypeptide is suitably recovered 
by centrifugation or filtration or as a slurry. Alternatively, the polypeptide can be 
recovered from the polymer-containing phase by re-extraction by addition of a suitable 
polymer, salt, or solvent. The PDI protein may also be separated from the recombinant 
tPA or PTI polypeptide at this stage. 

Once obtained from the liquid phase of the multiple-phase system, or at a later 
stage of purification, the polypeptide may be suitably stored in an appropriate buffer. 
The buffer can be any buffer known to those of skill in the art to preserve the biological 
activity and integrity of the isolated recombinant polypeptide. Such buffers include those 
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listed below in Section 5, or alternatively, CAPSO, glycine, CAPS, MOPS, HEPES, etc. 
may be employed preferably at a pH of from between about pH 6 and pl l 11, particularly 
at a concentration of about 20 mM. The polypeptide may be diluted with the buffer, or 
alternatively, the polypeptide may be dialyzed against fresh buffer. 

5. Examples 

The following examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventors 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

5.1 Example 1 - Production of the E. coli Host Cells BPTI in Bacteria 

The present work shows that rat rPDI expressed in the E. coli periplasmic space is 
able to catalyze the formation of disulfide bonds in bacterial proteins, and to complement 
several of the phenotypes of dsbA mutants. Expression of rPDI in the E. coli periplasm 
enhances expression of the recombinant multi-disulfide pancreatic trypsin inhibitor, 
BPTI. 

5.1.1 Materials and Methods 

5.1.1.1 Bacterial Strains and Plasmids 

The E. coli K12 strains used were JCB570 [MCI 000 phoR zihI2::TnJ0], JCB571 
[JCB570 dsbAr.kanl], JCB789 [JCB570 dsbBi.kan], JCB758 [JCB570 dsbA::kan 
dsbB::kan], JCB502[(lD69% lacZiiTnIO (Tet s by fusaric acid)] and JCB572 [JCB502 
dsbA::ka n l](Bardv,elletal., 1991; Bardwell et al., 1993). The last two strains contained 
F'\proAB, laclq, lacZ AMI 5, TnJO]. Plasmid pTI103 contains the OmpA leader-BPTI 
gene fusion and has been described previously (Goldenberg, 1988). Plasmid 
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pLPPsOmpArPDI contains the gene for the mature rat PDK fused to the OmpA signal 
sequence, under the control of the Ipp-Iac promoter (De Sutter et ah, 1994). 
pACYCBPTI contains the OmpA leader-BPTl gene fusion and the origin of replication 
ofpACYC184. 

5.1.1.2 General Methods - ^ 

Unless otherwise specified, cells were grown at 37°C in M9 minimal salts media, 
adjusted to pH 7.0, and supplemented with 0.2% glucose, and 0.2% casein. For labeling 
studies, cultures were supplemented with 50 |^g/ml L-amino acids [except cysteine and 
methionine] instead of casein. Ampicillin (Amp) (50 j-tg/ml) and/or chloramphenicol 
(Cam) (40 |ig/ml) was added as required. In the BPTI production studies, 100 ng/ml 
ampicillin and 1 70 fag/ml chloramphenicol were used to maintain the co-transformants. 

Fractionation by osmotic shock was carried out essentially as described by Neu 
and Heppel (1965). Sensitivity to filamentous phages was tested by diluting overnight 
cultures, grown without IPTG, to an OD 600nm = 0.005, followed by infection with phage 
JB4 (a Cam R , Ml 3 derivative). Subsequently the cells were plated on LB with 0.2% 
glucose and 20 [ig/ml Cam. Conjugation studies using SF1 03 (F AlacX74 glaE galK thi 
rpsL(strA) A phoA (PvuII) /7/r-J2::QCam R ) as the recipient strain were conducted as 
described (Silhavy et aL 9 1983). Rabbit polyclonal antisera against native BPTI 
(Boehringer-Mannheim) and rPDl were prepared using standard protocols (Ausubel et 
aL, 1989). 

5.1.1.3 Pulse-Chase Studies, Immunoprecipitation and Electrophoresis 

For monitoring the oxidative state of alkaline phosphatase and OmpA, 
mid-exponential phase cells were labeled with 100 jj.Ci/ml Trans 35 S Label (ICN 
Biomedicals Inc.) for 45 sec and chased with 20 mM methionine and 3 mM cysteine. 
Samples (1 ml) were withdrawn at various times and added to trichloroacetic acid on ice 
at a final concentration of 1 0%. The protein pellets obtained by centrifugation at 12,000 
x g for 10 min were resuspended in 0.5 ml of 100 mM Tris-HCl (pH 9.0), 1.5% SDS, 5 
mM EDTA and 35 mM iodoacetamide. Samples were then diluted 4-fold in 
immunoprecipitation buffer (10 mM Tris-HCl, pH 8.0, 0.1% Triton X-100®, 0.14 M 
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NaCl. and 0.025% NaN 3 ) and immunoprecipitated with antisera to alkaline phosphatase 
(from 5' Prime 3' Prime. Boulder, CO) and OmpA as previously described (Ostermeier 
and Georgiou, 1994). Oxidized and reduced proteins were resolved by SDS-PAGE in 20 
cm non-reducing gels essentially as described (Pollitt and Zalkin, 1983). Pulse-chase 
5 studies for following the kinetics of folding of BPT1 were carried out as previously 

described (Ostermeier and Georgiou, 1 994) except the chase contained 3 mM cysteine. 

S.l.1.4 Detection of BPTI by EL1SA 

Cells were induced with 0.1 mM IPTG.at OD^ = 0.3-0.35, and, for some 
1 0 samples, GSH and/or GSSG was added twenty min later at the concentrations indicated. 

Five hours after induction, samples were frozen at -70°C, then thawed to 4°C, lysed by 
French press (20,000 psi) and fractionated into insoluble and soluble fractions by 
centrifugation at 12,000 x g for 10 min. The protein concentration of the soluble fraction 
was measured by the Bio-Rad Protein Assay (Bio-Rad, Richmond, CA). Next, 100 ul of 
15 soluble protein diluted to a concentration of 2.5 ug protein/ml in ELISA coating buffer 

(32 mM Na 2 C0 3 /68 mM NaHCOj) was added to 96-well plates. After incubation 
overnight at 4°C, the wells were washed three times with washing buffer (0.5% Tween- 

(8) 

20 in phosphate buffer saline) and three times with ddH 2 0, blocked with 200 u.1 of 2% 
bovine serum albumin (Boehringer Mannheim) in phosphate buffered saline for 1 hour at 

20 37°C, and washed again. Subsequently, 100 Ml/well of BPTI antisera (diluted 1 : 1000 in 

phosphate buffer saline with 0.05% Tween-20® and 0.25% bovine serum albumin) was 
added to the plate and incubated for 1 hr at 37°C. The plate was then washed again as 
before and 100 u.1 of goat anti-rabbit horseradish peroxidase conjugate (diluted 1 : 1000 in 
phosphate buffer saline with 0.05% Tween-20® and 0.25% bovine serum albumin) was 

25 added to each well. After 30 min at 37°C and a final wash, 100 ul of Peroxidase 

Substrate ABTS (Bio-Rad) was added. Developing was stopped with 100 ul 2% oxalic 
acid after 5 min. and the A 410 was measured on a MR300 MicroElisa Reader (Dynatech 
Laboratories Inc., Chantilly, VA). The soluble fraction of cells without plasmid was 
spiked with known amounts of BPTI and used as standards. 

30 
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5.1.1.5 Affinity Precipitation of BPT1 with Trypsin-Agarose 

Soluble fractions from 1.5 ml culture volume were mixed with 1.5 ml of 50 mM 
Tris HC1 (pH 8.0) buffer and 12.5 ^1 trypsin-agarose beads (Sigma, 20 units/ml) and 
incubated on a rotator at 4°C. Subsequently, the beads were resuspended in SDS loading 
buffer, boiled for 5 min, centrifuged, and the soluble fractions were loaded onto 16% 
Tricine SDS-PAGE (Novex, San Diego, CA). Electrophoresis was carried out under 
reducing conditions. The proteins were then transferred to a PVDF membrane for 45 
min at 2.5 mA/cm 2 using the MilliBlot-Graphite Electroblotter System (Millipore, 
Bedford, MA) and immunologically detected with the anti-BPTI primary antibody 
(1:1000 dilution) followed by horseradish peroxidase-conjugated goat anti-rabbit IgG 
(1:3000 dilution) (Bio-Rad, Hercules, CA) (Ausubel et ah, 1989). 

5.1.2 Results 

The gene encoding the complete sequence of mature rat PDI has been fused to the 
bacterial OmpA leader peptide and expressed from the strong lac-lpp promoter (De 
Sutter et ah, 1994). Even in the absence of the inducer IPTG, a 55-kDa band 
corresponding to the mature rPDI is readily visible in SDS-PAGE gels of the osmotic 
shock fraction of E. coli (FIG. 1) and is the only band detected by Western blot analysis 
using rPDI-specific sera. Upon induction with 0.5 mM IPTG, rPDI was overexpressed 
and became the most prominent protein in the periplasmic space. In addition to the intact 
rPDI monomer, a lower molecular weight product, designated rPDIf, was evident in 
induced but not in uninduced cultures (FIG. 1). De Sutter et ah (1994) have shown that 
rPDIf corresponds to a polypeptide synthesized from an internal translation initiation 
codon in the rPDI gene. Although rPDIf is found predominantly in the spheroplast 
fraction, a portion is released by osmotic shock, as is evident from FIG. 1. Western 
blotting of samples from induced cultures also revealed several minor lower molecular 
weight species, presumably degradation products. The level of expression was identical 
in both the dsbA mutant strain JCB571 and in the isogenic control strain JCB570. 
Furthermore, western blot demonstrated that rPDI production was not substantially 
affected by the dsbA, dsbB or dsbAdsbB mutations under either induced or uninduced 
conditions. This is in contrast to other disulfide bond-containing secreted proteins such 
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as alkaline phosphatase, p-lactamase, urokinase and BPTI (Bardwell, 1994; Ostermeier 
and Georgiou, 1994). The production of these proteins is substantially reduced in dsbA 
and dsbB mutants, presumably because inefficient formation of disulfide bonds results in 
increased susceptibility to proteolytic degradation. 

While dsbA is not essential for cell viability, null mutants exhibit pleiotropic 
phenotypes including resistance to filamentous phages^ impaired motility and 
conjugation, poor growth in minimal media and formation of mucoid colonies when 
grown with sub-lethal concentrations of antibiotics (Bardwell, 1994; Bardwell el a/., 
1991). As shown in FIG. 2A and FIG. 2B, transformation with pLPPsOmpArPDI 
resulted in complementation of several dsbA phenotypes. Basal expression of rPDI in 
cells grown without inducer was sufficient to restore conjugation competence and 
sensitivity to fl phage to about 20% and 35% of the level in the parental strain, 
respectively. Expression of rPDI also restored the growth rate of dsbA' cells in minimal 
media to that of dsbA + cells (FIG. 2B). Complementation of the dsbA phenotypes is not 
merely due to the expression of a heterologous secreted protein since it was not observed 
in cells producing preOmpA-BPTI which is also exported in the R coli periplasmic 
space via the OmpA leader peptide and, like rPDI, contains six cysteines. It should be 
noted that in these studies, cells were not induced with JPTG because (a) rPDI was 
already expressed at significant levels without induction and (ii) the overproduction of 
rPDI in induced cultures was found to negatively affect the efficiency of conjugation in 
wild type cells. 

In some genetic backgrounds a null dsbA allege confers sensitivity to DTT 
(Missiakas et al. t 1993). However, the growth of JCB570 and JCB571 were similarly 
affected by the presence of reduced DTT or GSH in both rich and minimal media. Thus, 
in the JCB570 genetic background, it was not possible to determine whether the 
expression of rPDI affects the sensitivity of dsbA' cells to reducing agents. 

The ability of rPDI to complement the phenotypes of dsbA null mutants 
suggested that it must be able to catalyze the formation of disulfide bonds in the 
periplasmic space. Direct evidence for the function of rPDI in vivo was obtained by 
examining the kinetics of oxidation of two bacterial exported proteins, alkaline 
phosphatase and OmpA. Cultures were radiolabeled with 100 uCi/ml Trans 35 S Label 
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for 45 sec and then samples were added to iodoacetamide at different times to 
carboxymethylate free cysteine residues. Subsequently, reduced and oxidized alkaline 
phosphatase were resolved electrophoretically in 20 cm polyacrylamide gels. The 
formation of disulfide bonds in dsbA* cells was very rapid and was largely completed 
within one min. whereas in dsbA' cells, no oxidized alkaline phosphatase was detectable 
even after 10 min (FIG. 3A). However, in cells expressing" b^pal levels of rPDI, the 
formation of disulfide bonds was restored and oxidized alkaline phosphatase was the 
only species detectable after 10 min of chase. Transformation with a control plasmid 
(pTI103) did not have any effect on the oxidation state of alkaline phosphatase. Similar 
results were observed with the oxidation of the outer membrane protein OmpA which 
contains a single disulfide bond in the putative C-terminal periplasmic domain. 

For PDI to be functional as a direct oxidase, its active site must be regenerated 
through disulfide exchange with an appropriate donor/acceptor. Whereas in the ER the 
redox state of PDI is determined by the ratio of reduced to oxidized glutathione, there is 
no evidence for an analogous low molecular weight redox buffer in the periplasmic 
space. In E. coli, the reoxidation of DsbA is thought to be mediated by DsbB, a 
cytoplasmic membrane protein that contains at least four, and possibly five, cysteines 
within two periplasmic exposed loops (Guilhot et ai, 1995; Bardwell et al y 1993; 
Missiakas et al 9 1993; Dailey and Berg, 1993; Jander et aL, 1994). dsbB mutants exhibit 
a defect in disulfide bond formation, though not as severe as dsbA mutants. To 
determine whether the active state of rPDI may also be dependent on DsbB, the oxidation 
of alkaline phosphatase was monitored in dsbB mutaifts transformed with pLPPs 
OmpArPDL In dsbB mutants less than 30% of the alkaline phosphatase was found in the 
oxidized form even after 10 min post-chase (FIG. 3B). Expression of rPDI was largely 
unable to rescue the formation of oxidized alkaline phosphatase as only 50% of the 
alkaline phosphatase was oxidized after ten min. Furthermore, no oxidized protein was 
detected in dsbA dsbB double mutants with or without pLPPsOmpArPDl. Further 
evidence of rPDI's dependence on DsbB came from studying the production of the 
heterologous protein BPTI. As discussed in greater detail below, although rPDI could 
rescue the formation of BPTI in dsbA mutants, rPDI could not rescue the formation of 
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BPTI in dsbB mutants. Thus, the catalysis of disulfide formation by rPDl in E. coli is 
dependent on a functional dsbB gene. 

In dsbB mutants, but not in dsbA mutants or in wild type cells, the expression of 
rPDI appeared to mildly interfere with the processing of the leader peptide as evidenced 
5 by the presence of a band corresponding to the alkaline phosphatase precursor one min 

after the chase. A faint band corresponding to the precursor was also evident even after 
ten min (FIG. 3B). 

Under physiological conditions, the periplasmic space of E. coli is rather poor in 
disulfide isomerase activity, a function that is thought to be mediated primarily by DsbC 
1 0 (Bardwell, 1994). Since a major role of PDI in the endoplasmic reticulum appears to be 

the catalysis of disulfide bond isomerization (Bardwell and Beckwith, 1993; Wittrup, 
1995), it was reasoned that the presence of rPDl in the E. coli periplasm may facilitate 
the expression of heterologous proteins whose folding requires the rearrangement of 
disulfide bonds. The rate limiting step in the in vitro folding of BPTI (Creighton, 1992; 
Weissman and Kim, 1992; Goldenberg, 1992). In vitro, the presence of PDI modestly 
increases the rate of formation of two disulfide intermediates but greatly increases their 
rate of intramolecular rearrangement (Weissman and Kim, 1993) and possibly direct 
oxidation (Creighton et ai, 1980). Expression of secreted BPTI in R coli results in low 
levels of native protein and is accompanied by the accumulation of two disulfide 
intermediates in the periplasmic space (Ostermeier and Georgiou, 1994). To measure the 
effect of rPDI on BPTI expression, cells were co-transformed with pLPPsOmpArPDI and 
pACYCBPTI, a compatible plasmid carrying the BPTI gene. The cells were grown in 
minimal media supplemented with chloramphenicol and ampicillin to maintain both 
plasmids. Because the standard assay for BPTI, which is based on trypsin inhibition, is 
not very sensitive and suffers in part from interference from endogenous proteases and 
trypsin inhibitors, BPTI was quantified by ELISA using a primary polyclonal antibody 
raised against native BPTI. 

Coexpression of rPDI in wild type cells resulted in a six fold increase in BPTI in 
the absence of glutathione and a fifteen fold increase in its presence (FIG. 4). The 
increased yield with rPDI coexpression was not due to a higher rate of BPTI synthesis. If 
anything, co-expression of rPDI resulted in a slightly lower rate of BPTI synthesis as 
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determined by radiolabeling studies. This slight reduction in the rate of protein synthesis 
cannot account for the increased efficiency of native BPTI formation since in the absence 
of rPDI, BPTI production was not found to improve in cells where the synthesis of BPTI 
was reduced by lowering the amount of inducer, 1PTG. 

To confirm that the increased level of BPTI detected by ELISA was due to the 
production of native protein, BPTI was affinity precipitated wit^lrypsin immobilized on 
agarose beads and detected by Western blotting. The interaction between trypsin and 
native BPTI is exceedingly strong (dissociation constant 6 x 10" M M) and the complex is 
stable for weeks at 4°C (Vincent and Lazdunski, 1972). Reduced, carboxymethylated 
BPTI does not bind to trypsin. Coomassie staining of trypsin-precipitated samples from 
wild type cells not bearing any plasmid detected a band at approximately 16-kDa, the 
molecular weight of the E. coli trypsin inhibitor ecotin which has no homology with 
BPTI (McGrath et al. y 1991). Several other faint bands were also visible, but none of 
these bands crossreacted with anti-BPTI sera on Western blots (FIG. 5A, lane 2). When 
such E. coli extracts were spiked with high levels of purified BPTI and trypsin affinity 
precipitated more than one band was detected by Western blotting (FIG. 5A and FIG, 
5B). At lower protein loading, however, only a single band was visible. 

Western blots showing the level of BPTI in wild type, dsbA and dsbB cultures, 
with or without co-expression of rPDI, and in the presence of various amounts of reduced 
or oxidized glutathione are shown in FIG. 5 A and FIG. 5B. These studies confirmed that 
coexpression of rPDI increases the level of native BPTI production several fold and that 
production could be further enhanced by supplementing th^ growth media with moderate 
amounts of GSH. In the absence of rPDI co-expression, glutathione alone did not 
increase the production of BPTI. A modest increase was somewhat variable. The 
presence of high concentrations of GSH (25 mM) was found to adversely affect the 
production of BPTI both with and without rPDI co-expression. 

Cells lacking a functional DsbA or DsbB were found to be completely impaired 
in BPTI production, a deficiency which could not be alleviated by the addition of 
reduced or oxidized glutathione or a mixture thereof (FIG. 5B). Although exogenous 
oxidized glutathione can partially oxidize DsbA and thus complement some of the 
phenotypes of dsbB mutants, in this case it was not sufficient to rescue BPTI production. 
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Coexpression of rPDI was able to restore BPT1 production in dsbA mutants but not dsbB 
mutants, further illustrating that rPDl complements the dsbA mutation and that rPDPs 
oxidase activity is dependent on a functional DsbB protein. In dsbB mutants, BPT1 was 
detected only in cells coexpressing rPDI and supplemented with 10 mM oxidized 
glutathione. The addition of 10 mM cystamine to cultures expressing rPDI could also 
complement the dsbB mutation. Since PDI has been shownJo have a specificity for 
glutathione in forming the mixed disulfide in dithiol mediated oxidation (Darby et al, 
1994), these results suggest that the likely role of the added dithiols is to oxidize the 
active site of rPDI which then carries out direct oxidation of the protein substrate. 
Finally, is should be mentioned that 10 mM oxidized DTT, which is a much weaker 
oxidizer that GSSG or cystamine, could not rescue BPTI dsbB mutants coexpressing 
rPDI. 

In mutant cells, particularly dsbB mutants, the induction of rPDI resulted in the 
accumulation of two higher molecular weight species, one of which migrated with an 
electrophoretic mobility identical to the BPTI precursor. Both of these species were 
recognized by an antiserum against BPTI obtained from a different laboratory and are 
unlikely to represent crossreacting E. coli proteins. As was discussed above, the 
expression of rPDI in dsbB mutants results in some retardation of precursor processing. 
Therefore, it is possible that the higher molecular weight species detected in the dsbB 
mutants where rPDI was overproduced correspond to preOmpA-BPTI species. Such 
preOmpA-BPTI species must contain at least some of the native disulfides since 
otherwise they could not have been bound by the immobilized trypsin. 

To elucidate the role of PDI in the folding of BPTI, the kinetics of folding were 
monitored in pulse chase studies where folding intermediates were trapped by blocking 
free cysteines with iodoacetamide and separated by non-reducing gel electrophoresis. In 
the electrophoretic system used here, all two-disulfide intermediates (designated as *) are 
well resolved from the native protein and from other folding species (Ostermeier and 
Georgiou, 1994). Without coexpressing rPDI, these two disulfide intermediates 
accumulate during folding, and the rate limiting step in the formation of native protein is 
their isomerization. In cells expressing rPDI, two disulfide intermediates were still 
observed to accumulate (FIG. 6). Furthermore, when reduced glutathione is added to the 
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cells two disulfide intermediates accumulated to an even greater extent. In this case over 
50% of the BPT1 is found as two-disulfidc intermediate species after 20 min of chase. It 
appears that under these conditions rPDl in the E. coli periplasm did not have a 
noticeable effect on the rearrangement and/or direct oxidation of kinetically trapped 
5 two-disulfide intermediates. The exact fate of the * band is not known at present, but it 

must be degraded and/or eventually fold to native protein. W]£n lysates of wild type 
cells or dshA mutants expressing rPDI (which had been labeled with 14 C-L-arnino acids 
for 5 hours and protein free cysteines blocked with 100 mM iodoacetamide) were 
immunoprecipitated and resolved on non-reducing Reisfeld-Urea gels, only native BPTI 

10 with three disulfides was found to be present. 

In cells co-expressing rPDI the half-life for the formation of the native protein 
was approximately 6-7 min independent of whether GSH/GSSG had been added. This is 
experimentally indistinguishable from the rate of BPTI folding without rPDI (Ostermeier 
and Georgiou, 1994) and consistent with rPDI not having an effect on the rate limiting 

15 step: disulfide isomerization. For comparison, the folding of BPTI in eukaryotic 

microsomes, which are rich in PDI, can occur with a half life of less than one min at 
30°C and is accompanied by relatively little accumulation of two-disulfide intermediates, 
depending on the redox conditions employed (Creighton et ai 9 1993). 

Rat protein disulfide isomerase is expressed at a high level in the E, coli 

20 periplasm even in dsbA mutants. Besides restoring normal growth in minimal media, 

expression of rPDI restores conjugation competence and sensitivity to filamentous 
phages. These phenotypes are dependent on the presence of correctly assembled F pili, a 
process which is impaired when disulfide bond formation is compromised in dsbA 
mutants. Ostensibly, rPDI restores pili assembly by facilitating disulfide bond formation. 

25 Indeed, rPDI could catalyze the oxidation of native E. coli proteins such as alkaline 

phosphatase and OmpA in cells lacking DsbA, albeit at a rate somewhat slower than 
DsbA's. 

Although these results strongly suggest that rPDI functions as a cysteine oxidase 
in the periplasmic space of gram-negative bacteria, another explanation is that rPDI does 
30 not function catalytically, but instead it somehow induces the synthesis of other E. coli 

proteins that are responsible for disulfide bond formation. For example, Missiakas et ai 
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(1994) have shown that overexpression of DsbC can complement dsbA mutations and it 
may be possible that the phenomenon observed arose indirectly due to the induction of 
DsbC. This is not the case for the following reasons: (a) The oxidation of alkaline 
phosphatase in dsbA mutants expressing rPDI is dependent on DsbB. However, high 
levels of DsbC in fact complement dsbB mutations (Missiakas et ai, 1994). Thus, the 
function of rPDI cannot merely be due to the induction .of DshC since then it would not 
be expected to be dsbB dependent, (ii) As shown in FIG. 5A and FIG. 5B and discussed 
further below, expression of rPDI is essential for the formation of native BPTI in dsbA 
mutants. In the absence of rPDI, no BPTI is formed in dsbA mutants even when the cells 
are grown with a wide range of concentrations of exogenous thiols and disulfides. Thus 
it cannot be argued that rPDI simply changes the redox state of the periplasm; rather its 
enzymatic activity per se catalyzes the formation of disulfide bonds in BPTI. 

Given the presence of millimolar concentrations of glutathione in the ER (Hwang 
et ai, 5992) and PDI's specificity for glutathione (Darby et ai, 1994) it appears that in 
eukaryotes the active stale of PDI is maintained by glutathione. Since there is no 
evidence for the presence of glutathione or other low molecular weight thiols in the 
bacterial periplasmic space (Wolfing and Pluckthun, 1994), it is reasonable to assume 
that for rPDI to be functional in the periplasm, it must be able to interact directly with a 
component of the prokaryotic disulfide forming machinery. Indeed, the rPDI-mediated 
20 formation of disulfide bonds in alkaline phosphatase and BPTI was found to be 

dependent on DsbB. This is interesting given that rPDI, apart from its thioredoxin active 
site, shows little homology to DsbA or to other bacterial proteins. It may be that the 
active site of rPDI, whose three dimensional structure has not yet been solved, conforms 
to the thioredoxin fold as is the case with DsbA. If the active sites of the two proteins are 
structurally similar, then it is reasonable to expect that a protein such as DsbB, which 
normally interacts with DsbA, may also be able to interact with rPDI. Alternatively, 
rPDI's dependence on DsbB may be an indirect effect. 

In catalytic amounts and in the presence of glutathione or other small molecular 
weight thiols, PDI primarily catalyzes the formation of mixed disulfides with model 
peptide substrates (Darby et al, 1994). However, the lack of a periplasmic low 
molecular weight redox couple in bacteria and rPDI's dependence on DsbB implies that 
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rPDI catalyzes disulfide bond formation by direct transfer of its own disulfide bonds to 
protein substrates. Direct formation of disulfide bonds form PD1 to reduced proteins has 
been observed in vitro with stoichiometric quantities of the enzyme and in the absence of 
other oxidants (Darby et aL, 1994; Zapun and Creighton, 1994). It should be noted that 
DsbA also normally transfers its disulfide bond directly to reduced protein substrates 
(Zapun and Creighton, 1994), but unlike with PDI, this reactions not enhanced by the 
addition of glutathione redox buffers (Joly and Swartz, 1994). 

Gram-negative bacterial proteins fold faster under very oxidizing conditions. For 
example, the optimal rate for folding of alkaline phosphatase in vitro occurs in 6 raM 
GSSG and proceeds at 50% the maximal rate in 30 mM GSSG (Walker and Gilbert, 
1994). Accordingly, there is evidence that the periplasmic space is indeed highly 
oxidizing environment. Most, if not all, of the DsbA molecules have been shown to be 
in the oxidized form in the periplasm of wild type cells (Kishigami et ai t 1995). The 
highly oxidized state of the periplasm can in part explain why eukaryotic proteins 
containing multiple disulfides are often poorly expressed. In the endoplasmic reticulum, 
these proteins normally fold in a relatively reduced environment which affords the 
opportunity for reduction of incorrect disulfides and disulfide rearrangement, processes 
catalyzed by PDI (Freedman et aL> 1994; Wittrup, 1995). Disulfide bond isomerization 
is likely to be relatively unimportant to E. coli as periplasmic and outer membrane 
proteins with more than two disulfides are rare (Joly and Swartz, 1994). 

For rPDI to be efficient in the oxidation of alkaline phosphatase in dsbA mutants, 
it must provide a redox environment comparable to that afforded by DsbA, an equivalent 
[GSH] 2 /[GSSG] equilibrium constant of around 20 f±M (Walker and Gilbert, 1994). The 
redox potential of PDI at near physiological pH is equivalent to [GSH] 2 /[GSSG] of 
around 40-80 j-iM. Thus, a significant fraction, if not the majority, of the rPDI molecules 
in the periplasm must be present in the oxidized form in order to effectively facilitate 
disulfide formation. This is quite different from the endoplasmic reticulum where the 
redox state is believed to be [GSH] 2 /[GSSG] = 0.5-3.3 mM and PDI would be present 
almost exclusively in reduced form (Hwang et al, 1992). 

To examine whether rPDI can catalyze disulfide bond isomerization in the 
periplasm, the oxidative folding of BPT1 was monitored. BPTI is a three disulfide 
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protease inhibitor which is very poorly expressed in E. coli and whose in vitro folding 
pathway involves disulfide rearrangement as shown in Scheme 1 (Goldenberg. 1 992). - 
PD1 has been shown to catalyze virtually all of the steps in the in vitro folding 
pathway of BPT1 (Weissman and Kim, 1993; Creighton et al, 1980). In the presence of 
catalytic amounts of protein disulfide isomerase and a suitable redox buffer, the rate of 
formation of N' and N* from reduced protein is increased by about three-fold whereas 
the subsequent folding of these two kinetically trapped intermediates to N is accelerated 
by more than 3,000 fold (Weissman and Kim, 1993) resulting in a dramatic decrease in 
the amount of two disulfide intermediates observed during folding. Catalysis also 
appears to occur in -vivo as evidenced by the fact that relatively small amounts of two 
disulfide intermediates were detected during the folding of BPTI in microsomes 
(Creighton et al., 1993). 

Co-expression of rPDI in wild type E. coli increased the steady-state level of 
BPTI by fifteen fold in the presence of glutathione and six fold in its absence. However, 
1 5 pulse chase studies revealed that the presence of rPDI does not decrease the accumulation 

of two-disulfide intermediates. In fact, in the presence of glutathione, two disulfide 
intermediates accumulated to a greater extent. Thus, in wild type E. coli, rPDI does not 
facilitate the isomerization of N' and N* to N SH s " anymore than the formation of N' and 
N*. 

In wild type cells, rPDI does not function as an appreciable isomerase or direct 
oxidant of the final disulfide in the two disulfide intermediates, the rate limiting step in 
folding. rPDI's apparent lack of isomerase activity in the periplasm is not surprising 
since evidence suggests that the active sites of rPDI in the periplasm are predominantly 
oxidized and therefore can only catalyze direct oxidation and not disulfide 
rearrangement. Attempts to improve BPTI production in vivo by making the periplasm 
more reducing (/.*., by adding reducing agents or by using dsb mutants) in order to elicit 
rPDI isomerase activity were unsuccessful, but should not be construed as evidence of an 
inability of rPDI to exhibit isomerase activity in the periplasm. In a more reducing 
periplasm, the rates of formation of the first and second disulfides in BPTI should 
decrease and thus the competing process of proteolysis is likely to limit the yield of 
correctly folded protein. It may be that the conditions for eliciting rPDI's isomerase 
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activity leave the periplasm too reducing for any BPTI molecules to efficiently form 
disulfides in order to avoid proteolysis. 

The increase in two disulfide intermediates with added glutathione indicates that 
rPDl is functioning as an oxidase that supplements DsbA in forming the two disulfide 
5 intermediates. In v/7ro, PDI's oxidase activity has been found to be enhanced by the 

addition of a redox buffer of glutathione (Darby et al % 1994). ^ 

Increased steady state levels of BPTI appear to be the result of the faster 
formation of two disulfide intermediates from reduced protein thus avoiding proteolysis. 
It is also conceivable that the chaperon-like activity of PDI may play a role in the 
10 increased levels of BPTI (Puig and Gilbert, 1994; Cai et al, 1994). Studies have 

determine the effect of PDI mutants with (a) only chaperon activity and (b) primarily 
isomerase activity on the folding of BPTI in order to further elucidate the mechanism by 
which its co-cxpression facilitates the production of multi-disulfide containing proteins. 

5.2 Example 2 — Methods for Expression of tPA in Bacterial Cells 

This example describes the production of soluble, active, secreted tPA in bacterial 
host cells by the co-expression of rPDI. Remarkably, rPDl coexpressed with rtPA 
significantly increases yields of tPA in the bacterial host cell. 

Strains RB791 and UT5600 were co-transformed with a pACYC184 derivative 
vector expressing tPA from the phoA promoter and with a pBR322 recombinant vector 
carrying a transcriptional unit comprising the PDI gene expressed downstream from 
either the lac-lpp or the P BAD promoter (Guzman et al, 1^95). The latter construct was 
designated pBAD-Stll-tPA. 

Expression of the two proteins was induced by the addition of inducers, IPTG, 
and arabinose, as appropriate. The cells were harvested after overnight growth and the 
level of tPA was determined by three methods. 

In the first method, a colorimetric assay was used in which the rate of plasmin 
formation from plasminogen was measured (American Diagnostics, Inc.). The second 
method was the fibrin plate assay, in which tPA-containing cell extracts were spotted on 
agar plates containing fibrin and plasminogen and the zone of clearance was determined. 
Finally, in the third method, Western blots were performed using anti-tPA antibodies. 
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Active tPA was detected by both the fibrin plate assay and by the plasmin 
activation assay in cells co-expressing PDI. The level of active tPA protein was between 
about 5 and about 12 ug/L/OD 600nm units of culture, depending upon the strain 
background and particular growth conditions used. Importantly, practically no active 
tPA could be detected in cultures that did not co-express PDI. The co-expression of PDI 
increased the amount of soluble, active tPA, but did not affect the total amount (soluble + 
insoluble) detected by Western blotting. 

5.3 Example 3 - Cloning and Expression of the Yeast PDI in E. coli 

The mature yeast PDI sequence was cloned by PCR™ amplification of the gene 
from an 5". cerevisiae cDNA library (obtained from Professor K.D. Wittrup, Department 
of Chemical Engineering, University of Illinois, Urbana-Champaign) using the following 
primers: 

GG1 8: 5'ATATGAATTCTGGTTTTCGCCCAACAAGAAGCTGTGGCC-3' (SEQ. ID 
15 NO.:l) 

and 

GG28: 5-'GGACGGAGGATCCTTACAATTCATGGTG-3* (SEQ. ID: 2). 

The amplified DNA product contains a XbaJ and BamHl restriction sites allowing 
the insertion of the gene into the expression vector pJG105 (Grrayeb et ah, 1984). The 
20 resulting plasmid was designated pLpptoc-YPDI-I. In pLpp/oc-YPDI-1 the yeast PDI 

gene is fused in frame to the OmpA leader peptide. The ompA-ypdi gene is downstream 
from a strong ribosomal binding site and the IPTG-inducible Ipp-lac promoter. pLpplac- 
YPDI-1 was transformed into a variety of E. coli strains including RB791, RI89 and 
Rl 90 (Bard well et ai, 1 991 ). Induction of ypdi syntheses by I PTG resulted in 
25 somewhat slower growth. A band corresponding to the full length yeast PDI was 

detected by Western blotting using a polyclonal antiserum raised against the yeast PDI. 

The pLpp/oc-YPDI-1 plasmid and pJG105 as a control were transformed in the 
dsbA* and dsbA' strains Rl 89 and Rl 90, respectively (FIG. 8). dsbA mutant cells do not 
express alkaline phosphatase activity. Expression of the rat PDI in E. coli restores 
30 alkaline phosphatase activity in dsbA' mutants and a similar effect was seen in cells 

transformed with pLpp/ac-YPDI-I and inducted with IPTG. Thus, the yeast PDI is 
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functional when expressed in bacteria and can complement the defect in dshA' cells. In 
separate studies, RB791 cells were transformed with pACYCBPTI and pLpplac-YPDl-1 . 
Induction of BPTI and yPDI with 0.5 MM IPTG resulted in f-fold higher specific BPTI 
levels (per mg of total soluble protein) compared to cells that did not coexpress the yeast 
5 PD1. Importantly, unlike the case with rate PDI, the effect of years PDI could not be 

improved further by the additional of reductants such as gltitathii^le. 
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ClA J M S 

A process for producing in a bacterial cell, a biologically-active, soluble 
eukaryotic polypeptide having at least about three disulfide bonds, comprising 
expressing in said cell a first DNA segment encoding a disulfide isomerase 
operably linked to a signal sequence and a second DNA segment encoding 
said eukaryotic polypeptide operably linked" to " a|kignai sequence under 
conditions effective to produce said eukaryotic polypeptide. 

The process according to claim 1 , wherein said eukaryotic polypeptide is a 
mammalian polypeptide. 



The process according to any preceding claim, wherein said eukaryotic 
polypeptide is a human or bovine polypeptide. 

The process according to any preceding claim, wherein said eukaryotic 
polypeptide is a tissue plasminogen activator or pancreatic trypsin inhibitor. 

The process according to any preceding claim, wherein said disulfide 
isomerase is protein disulfide isomerase. 

The process according to claim 5, wherein said protein disulfide isomerase is a 
rat, yeast, or human protein disulfide isomerase. * 

The process according to any preceding claim, wherein said eukaryotic 
polypeptide comprises at least seven disulfide bonds. 

The process according to claim 7, wherein said eukaryotic polypeptide 
comprises at least twelve disulfide bonds. 

The process according to claim 8, wherein said eukaryotic polypeptide 
comprises at least fourteen disulfide bonds. 
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10. The process according to claim 9, wherein said eukaryotic polypeptide 
comprises at least seventeen disulfide bonds. 

5 11. The process according to claim 10, wherein said eukaryotic polypeptide 
comprises seventeen disulfide bonds. " 

12. The process according to any preceding claim, wherein said signal sequence is 
selected from the group consisting of OmpA, LamB, StII, MalE, Lpp, and 

10 PelB. 

13. The process according to claim 12, wherein said signal sequence is an OmpA 
signal sequence. 

15 14. The process according to any preceding claim, wherein said first and said 
second DNA segments are expressed from a promoter selected from the group 
consisting of lac-lpp, Ipp, trc. tac, T7, Pkad* P^oA and X VL . 

15. The process according to claim 14, wherein said first and said second DNA 
20 segments are expressed from a lac-lpp promoter. 

16. The process according to claim 15, wherein said first DNA segment -is 
expressed by pLPPsOmpArPDI. 

25 17. The process according to claim 15, wherein said second DNA segment is 
expressed by pTPA177 or pACYCBPTI. 

18. The process according to claim 1, wherein said bacterial cell is cultured in a 
medium comprising one or more reducing agents selected from the group 
30 consisting of glutathione, cysteine, cystamine, thioglycollate, dithiothreitol 

and dithioerythritol. 

-98- 



sIS DOC ID: <WO 8738123A1 I > 



WO 97/38123 



PCT/US9 7/05636 



10 



19. The process according lo any preceding claim, wherein said bacterial cell is an- 
Enterobacteriaceae cell. 

20. The process according to any preceding claim, wherein said bacterial cell is an 
Escherichia or Salmonella spp. cell. 

21 . The process according to any preceding claim, wherein said bacterial cell is an 
E. coli cell. 

22. The process according to any preceding claim, wherein said E. coli cell is 
selected from the group consisting of ATCC XXXXX, SF103, SF110, 
UT560O and RB7911. 



15 23. The process according to any preceding claim, wherein said eukaryotic 
polypeptide is secreted to the periplasm or to the outer membrane of said 
bacterial cell. 

24. The process according to any preceding claim, wherein said eukaryotic 
20 polypeptide is isolatable from a culture supernatant or a soluble fraction of 

said bacterial cell. 

25. The process according to any preceding claim, wherein said eukaryotic 
polypeptide produced in said bacterial cell has a specific activity equal to or 

25 greater than the specific activity of said polypeptide when produced in a 

eukaryotic host cell. 

26. The process according to any preceding claim, wherein said eukaryotic 
polypeptide is a tissue plasminogen activator protein having a specific activity 

30 of at ,ea st about 5 to about 1 2 ug/l/OD 600nm of culture. 
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27. The process according to any preceding claim, wherein said eukaryotic 
polypeptide is a recombinant polypeptide. 

28. The process according to any preceding claim, wherein said eukaryotic 
5 polypeptide produced in said bacterial host assumes a conformation 

substantially identical to the conformation assumed bjqpaid polypeptide when 
produced in a eukaryotic host cell. 

29. An expression system for producing in a bacterial cell, a biologically-active, 
10 soluble eukaryotic polypeptide, said system comprising a first DNA segment 

and a second DNA segment, wherein said first segment encodes a disulfide 
isomerasc and said second segment encodes a eukaryotic polypeptide having 
at least about three disulfide bonds. 

15 30. The expression system according to claim 29, wherein said eukaryotic 
polypeptide is a mammalian polypeptide. 

3 1 . The expression system according to either claim 29 or claim 30, wherein said 
eukaryotic polypeptide is a human or bovine polypeptide. 

20 

32. The expression system according to any of claims 29 to 31, wherein said 
eukaryotic polypeptide is a tissue plasminogen aeiivator or pancreatic trypsin 
inhibitor. 

25 33. The expression system according to any of claims 29 to 32, wherein said 
disulfide isomerase is protein disulfide isomerase. 

34. The expression system according to any of claims 29 to 33 ? wherein said 
disulfide isomerase is a rat, yeast, or human protein disulfide isomerase. 

30 
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35. The expression system according to any of claims 29 to 34, wherein said 
eukaryotic polypeptide comprises at least seven disulfide bonds. 

36. The expression system according claim 35, wherein said eukaryotic 
5 polypeptide comprises at least twelve disulfide bonds. 

37. The expression system according to claim 36, "wherein said eukaryotic 
polypeptide comprises at least fourteen disulfide bonds. 

10 38. The expression system according to claim 37, wherein said eukaryotic 
polypeptide comprises at least seventeen disulfide bonds. 

39. The expression system according to claim 38, wherein said eukaryotic 
polypeptide comprises seventeen disulfide bonds. 



15 



20 



40. The expression system according to any of claims 29 to 39, wherein said first 
DNA segment or said second DNA segment further comprises a signal 
sequence. 

41. The expression system according to claim 40, wherein said signal sequence is 
selected from the group consisting of OmpA, LamB, StII, MalE, Lpp, and 
PelB^ 



42. The expression system according to claim 41, wherein said signal sequence is 
25 an OmpA signal sequence. 

43. The expression system according to any of claims 29 to 42, wherein said first 
DNA segment and said second DNA segment are expressed from a promoter 
selected from the group consisting of lac-lpp, ipp, trc f tac, T7, P BAD ,phoA and 

30 \ PL . 
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44. The expression system according to claim 43, wherein said first DN A segment 
or said second DNA segment is expressed from a Iac-ipp promoter. 

45. The expression system according to any of claims 29 to 44, wherein said first 
5 DNA segment is expressed by pLPPsOmpArPDl. 

46. The expression system according to any of claims 29 to 45, wherein said 
second DNA segment is expressed by pTPA177 or pACYCBPTL 

10 47. The expression system according to any of claims 29 to 46, wherein said 
bacterial cell is an Enterobacteriaceae cell. 

48. The expression system according to any of claims 29 to 46, wherein said 
bacterial cell is an Escherichia or Salmonella spp. cell. 

15 

49. The expression system according to any of claims 29 to 46, wherein said 
bacterial cell is an E. coli cell. 

50. The expression system according to claim 49, wherein said bacterial cell is an 
20 E. coli ATCC XXXXX, SF103, SF1 10, UT5600 or RB791 cell. 

51. The expression system according to any of claims 29 to 50, wherein said 
eukaryotic polypeptide is secreted to the periplasm or to the outer membrane 
of said bacterial cell. 

25 

52. The expression system according to any of claims 29 to 51, wherein said 
eukaryotic polypeptide is isolatable from a culture supernatant or a soluble 
fraction of said bacterial cell. 

30 53. The expression system according to any of claims 29 to 52, wherein said 
eukaryotic polypeptide produced in said bacterial cell has a specific activity 
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equal to or greater than the specific activity of said polypeptide when produced 
in a eukaryotic host cell. 

54. The expression system according to any of claims 29 to 53. wherein said 
eukaryotic polypeptide has a specific activity of at least about 1 to about 1000 
|^g/l/OD 600nm of culture. 

55. The expression system according to claim 54, wherein said eukaryotic 
polypeptide has a specific activity of at least about 5 to about 500 
ug/l/OD 600nm of culture. 

56. The expression system according to claim 55, wherein said eukaryotic 
polypeptide has a specific activity of at least about 10 to about 100 
u.g/l/OD 600nill of culture. 

57. The expression system according to any of claims 29 to 56, wherein said 
eukaryotic polypeptide is a tissue plasminogen activator protein having a 
specific activity of at least about 5 to about 12 ng/I/OD 600nm units of culture. 

58. The expression system according to any of claims 29 to 57, wherein said 
eukaryotic polypeptide is a recombinant polypeptide. 

59. The expression system according to any of claims 29 to 58, wherein said 
eukaryotic polypeptide produced in said bacterial host has a conformation 
substantially identical to the conformation said polypeptide has when 
produced in a eukaryotic host cell. 

60. The expression system according to any of claims 29 to 59, wherein said first 
DNA segment and said second DNA segment are contained within a single 
recombinant vector. 
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61. The expression system according to any of claims 29 to 59, wherein said first 
DNA segment is contained within a first recombinant vector and said second 
DNA segment is contained within a second recombinant vector. 

5 62. The expression system according to claim 61 /wherein said first and said 
second recombinant vectors are capable of being -cq^cpressed in a single 
bacterial cell. 

63. The expression system according to claim 62, wherein said first recombinant 
10 vector is pLPPsOmpArPDI and said second recombinant vector is selected 

from the group consisting of pACYCBPTI and pTPA177. 

64. A recombinant vector comprising a first transcriptional unit encoding a 
mammalian protein disulfide isomerase operably linked to a first signal 

15 sequence and a second transcriptional unit encoding a mammalian polypeptide 

having at least about three disulfide bonds operably linked to a second signal 
sequence. 

65. The recombinant vector according to claim 64, wherein said mammalian 
20 polypeptide is a human or bovine polypeptide. 

66. The recombinant vector according to claim 64 or 65, wherein said mammalian 
polypeptide is a tissue plasminogen activator or pancreatic trypsin inhibitor. 

25 67. The recombinant vector according to any of claims 64 to 66, wherein said 
protein disulfide isomerase is a rat, yeast, or human protein disulfide 
isomerase. 

68. The recombinant vector according to any of claims 64 to 67, wherein said 
30 mammalian polypeptide comprises at least seven disulfide bonds. 
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69. The recombinant vector according to any of claims 64 to 68. wherein said 
mammalian polypeptide comprises at least twelve disulfide bonds. 

70. The recombinant vector according to any of claims 64 to 69, wherein said 
mammalian polypeptide comprises at least fourteen disulfide bonds. 

71. The recombinant vector according to any of claims^64 to 70, wherein said 
mammalian polypeptide comprises at least seventeen disulfide bonds. 

72. The recombinant vector according to any of claims 64 to 71, wherein said 
mammalian polypeptide comprises seventeen disulfide bonds. 

73. The recombinant vector according to any of claims 64 to 72, wherein said first 
transcriptional unit or said second transcriptional unit further comprises a 

15 signal sequence. 

74. The recombinant vector according to claim 73, wherein said signal sequence is 
selected from the group consisting of OmpA, LamB, StII, MalE, Lpp, and 



10 



20 



25 



PelB. 



75. 



The recombinant vector according to claim 74, wherein said signal sequence is 
an OmpA signal sequence. 



76. The recombinant vector according to any of claims 64 to 75, wherein said first 
transcriptional unit or said second transcriptional unit further comprises a 
promoter selected from the group consisting oflac-lpp, lpp, trc, tac^ T7, P BAD , 
phoA and X. PL . 

77. The recombinant vector according to claim 76, wherein said promoter is a lac- 
30 lpp promoter. 
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78. The recombinant vector according to any of claims 64 to 77. wherein said 
vector comprises a gene encoding rat or yeast PDI and a gene encoding human 
tissue plasminogen activator or bovine pancreatic trypsin inhibitor. 

5 79. The recombinant vector according to any of claims 64 to 78, wherein said 
vector is capable of expression in a Salmonella spp. eeH^br an Escherichia coli 
cell selected from the group consisting of E. coli ATCC XXXXX, SF103, 
SF 1 1 0, UT5600 and RB79 1 . 

10 80. The recombinant vector according to any of claims 64 to 79, wherein said first 
and said second transcriptional units are expressed from the same promoter or 
from different promoters. 

81. A host cell transformed with the recombinant vector according to any of 
15 claims 64 to 80. 

82. A composition comprising a biologically-active, soluble, recombinant tissue 
plasminogen activator protein or peptide operably linked to a bacterial export 
signal peptide. 

20 

83. The composition according to claim 82, wherein said tissue plasminogen 
activator is mammalian tissue plasminogen activator. 

84. The composition according to claim 82 or 83, wherein said tissue plasminogen 
25 activator is human tissue plasminogen activator. 

85. The composition according to any of claims 82 to 84, wherein said bacterial 
export signal peptide is selected from the group consisting of OmpA, LamB, 
StII, MalE, Lpp, and PelB. 

30 
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86. The composition according to any of claims 82 to 85, wherein said 
recombinant tissue plasminogen activator protein is encoded by a DNA- 
segment positioned under the control of a promoter selected from the group 
consisting of lac-lpp, Ipp, trc. tac, T7, P RAD , pho.4 and A Pl . 

87. The composition according to any of claims 82 to 86, wherein said tissue 
plasminogen activator protein has a specific activity^ of at least about 1 to 
about 1000 ug/L/ODsoo,,,,, of culture. 



10 88. The composition according to any of claims 82 to 87, wherein said tissue 
plasminogen activator protein has a specific activity of at least about 5 to 
about 500 ng/L/OD 600nm of culture. 

89. The composition according to any of claims 82 to 88, wherein said tissue 
15 plasminogen activator protein has a specific activity of at least about 10 to 

about 100 ug/L/OD 600nm of culture. 

90. A composition comprising a biologically-active, soluble, recombinant 
pancreatic trypsin inhibitor protein or peptide operably linked to a bacterial 

20 export signal peptide. 

91. The composition according to claim 90, wherein said pancreatic trypsin 
inhibitor is mammalian pancreatic trypsin inhibitor. 



25 92. 



The composition according to claim 90 or 91 , wherein said tissue plasminogen 
activator is human or bovine pancreatic trypsin inhibitor. 



93. The composition according to any of claims 90 to 92, wherein said bacterial 
export signal peptide is selected from the group consisting of OmpA, LamB, 
30 StII, MalE, Lpp, and PelB. 
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94. The composition according to any of claims 90 to 93, wherein said pancreatic 
trypsin inhibitor protein is encoded by a DNA segment positioned under the 
control of* a promoter selected from the group consisting of lac-lpp^ Ipp, trc % 
tac % T7, P B ad> phoA and X PL . 

5 

95. The composition according to any of claims 90 to* 94; A^ferein said pancreatic 
trypsin inhibitor protein has a specific activity of at least about 1 to about 1 000 
Hg/L/OD 600nm of culture. 

10 96. The composition according to any of claims 90 to 95, wherein said pancreatic 
trypsin inhibitor protein has a specific activity of at least about 5 to about 500 
|Ag/L/OD 600nm of culture. 

97. The composition according to any of claims 90 to 96, wherein said pancreatic 
1 5 trypsin inhibitor protein has a specific activity of at least about 10 to about 100 

^g/L/OD 600nm of culture. 
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